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Introduction 


The  BRCAl  gene  was  mapped  to  chromosome  17q  in  1990  by  Mary-  Claire  King  and 
colleagues.  Shortly  thereafter.  Dr.  Narod  was  able  to  confirm  that  BRCAl  was  the  gene  for 
the  breast-ovarian  cancer  syndrome  as  weE  From  1990-1994,  a  series  of  experiments  in 
several  laboratories  confirmed  that  BRCAl  was  the  gene  responsible  for  the  majorify  of 
families  with  multiple  cases  of  breast  and  ovarian  cancer.  The  risk  of  breast  cancer  for 
women  who  carry  a  mutant  copy  of  this  gene  is  87%  to  age  70.  The  risk  of  ovarian  cancer  is 
50% .  The  purpose  of  the  present  study  is  to  clone  the  BRCAl  gene  and  to  identify  the  range 
of  mutations  present  in  the  families  with  the  breast  ovarian  cancer  syndrome. 


The  objectives  of  the  study  were  stated  as  follows: 


1)  To  identify  highly  informative  polymorphic  markers  in  the  region  of  BRCAl. 

2)  To  fype  linked  breast  and  ovarian  cancer  families  with  these  markers  and  to  identify 
all  of  the  crossovers  which  provide  information  about  the  minimiuii  region  of 
chromosome  17  containing  BRCAl. 

3)  To  produce  a  physical  map  of  the  minimum  region  by  identifying  a  series  of 
overlapping  contiguous  fragments  of  YAC  DNA  inserts  that  span  the  region. 

4)  To  identify  the  coding  sequences  in  the  cloned  YACs  and  to  assay  the  expression  of 
these  genes  in  normal  breast  and  ovary  and  in  tumors  derived  from  these  tissues. 
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5)  To  test  genes  identified  to  be  in  the  minimum  region  for  rearrangements  and  for 
point  mutations  tiiat  may  be  associated  with  cancer  predisposition  in  the  families. 

6)  To  sequence  the  BRCAl  gene  in  our  panel  of  100  hereditary  breast  and 
breast-ovarian  cancer  families  and  to  identify  the  range  of  mutations  in  this  gene. 

7)  To  evaluate  the  association  between  particular  mutations  and  cancer  patterns  in  the 
panel  of  famities 

8)  To  sequence  the  BRCAl  gene  in  the  constitutional  DNA  of  panel  of  women  with 
apparently  sporadic  breast  and  ovarian  cancers. 

9)  To  sequence  flie  BRCAl  gene  in  the  tumor  DNA  of  panel  of  women  with 
apparently  sporadic  breast  and  ovarian  cancers. 

Objectives  1  to  5  have  been  accomplished  throu^  a  collaboration  of  Dr.  Narod  and  other 
researchers. 

Objective  6  has  been  reached  for  60  families  and  should  be  completed  by  July  1, 1995 

Objective  7  is  in  progress.  Early  results  do  not  support  the  hypothesis  that  different 
mutations  are  associated  with  different  phenotypes.  Rather,  it  appears  that  the  other  genes 
wiU  be  more  important  in  modifying  genetic  risk. 

Objective  8  is  in  progress  for  two  groups  -  Ashkenazi  Jewish  women  and  French  Canadian 
women. 
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Objective  9  has  been  studied  by  others.  It  is  found  that  mutations  in  BRCAl  are  not 
frequent  in  sporadic  breast  and  ovarian  tumours.  The  great  majority  of  mutations  detected 
have  been  germline  (hereditary)  mutations.  This  objective  will  not  be  pursued  further. 

Details  of  the  progress  of  these  individual  objectives  follows. 
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Body 


6.0  Overview.  The  goal  of  the  current  study  was  to  identify  the  BRCAl  gene  through 
positional  cloning.  The  approach  involved  a  combination  of  genetic  mapping  and  physical 
mapping,  leading  to  the  identification  of  candidate  genes  from  chromosome  17q.  Through 
a  collaborative  effort  the  BRCAl  gene  was  cloned  in  October  1994  (Miki  et  al,  1994).  Dr. 
Narod  was  a  contributing  partner  in  this  collaboration. 

The  cloning  of  the  BRCAl  gene  took  place  in  year  one  of  the  grant  period  of  Dr 
Narod.  Dr.  Narod  has  capitalized  on  the  early  results  and  has  gone  on  to  use  these  results 
to  investigate  several  aspects  of  the  population  genetics  of  BRCAl,  including;  establishing 
the  range  of  BRCAl  mutations  in  breast-ovary  famiHes;  2)  establishing  mutation/haplotype 
correlations;  3)  establishing  the  existence  of  predominant  mutations  in  ethnic  subgroups;  4) 
establishing  the  proportion  of  breast  cancer  families  due  to  BRCAl;  5)  establishing  genetic 
and  non-genetic  modifiers  of  BRCAl  penetrance. 

This  body  of  work  has  contributed  to  the  completion  of  12  papers  (9  published,  1  in 
press,  2  submitted).  These  papers  are  listed  in  the  References  below  and  a  copy  of  each  is 
provided  in  the  Appendix.  Dr.  Narod  is  the  first  author  on  three  papers,  is  the  senior  author 
on  four  other  papers  and  is  a  contributing  author  on  the  five  remaining  papers. 

6.1  Genetic  mapping  of  BRCAl.  In  1994  the  region  of  assignment  was  established  to  be  an 
interval  of  1.2  kb  surrounding  the  gene  for  estradiol  dehydrognease  (EDH17B).  This  was  a 
key  candidate  gene  and  had  been  tentatively  ruled  out  by  direct  sequence  analysis,  but  not 
by  linkage  analysis.  By  examining  a  genetic  recombinant  in  a  45-year  old  woman  in  a 
family  with  10  cases  of  early-onset  breast  cancer  and  a  single  case  of  ovarian  cancer  it  was 
possible  to  map  the  BRCAl  gene  distal  to  the  EDH17B  locus  (Tonin  et  al,  1994).  This 
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became  the  proximal  boundary  for  BRCAl.  Through  a  collaboration  with  the  University  of 
Utah,  a  distal  boundary  for  the  BRCAl  interval  (D17S78)  was  established.  These  two 
markers  formed  the  genetic  interval  of  assignment  of  BRCAl,  a  region  of  bOOkb. 

6.2  Physical  mapping  of  BRCAl.  At  the  same  time  as  the  construction  of  the  genetic  map, 
a  physical  map  of  cloned  DNA  fragments  (YACS)  was  tmder  construction  in  the  laboratory 
of  Dr  Narod,  in  collaboration  with  Dr.  J.  Rommens  of  the  Hospital  for  Sick  Children 
Toronto  for  the  region  corresponding  to  the  genetic  map  (Rommens  et  al,  1995).  This 
physical  map  was  used  to  identify  30  unique  cDNA  fragments,  by  hybridization  of  RNA 
hbraries  to  immobilized  YACs.  These  30  clones  were  aligned  to  form  ten  transcription 
units.  Several  known  and  unknown  genes  were  mapped  by  this  process,  including 
gamma-tubuhn  and  the  Ki  antigen  gene.  None  of  these  30  clones  was  the  BRCAl  gene.  The 
construction  of  the  physical  map  using  the  YAC  Hbrary  took  place  in  the  laboratory  of  Dr 
Narod,  the  retrieval  of  gene  fragments  took  place  in  flie  laboratory  of  Dr.  Rommens  and  the 
sequencing  of  the  fragments  took  place  in  the  laboratory  of  Dr.  Jacques  Simard. 

6.3  Identification  of  BRCAl  mutations.  The  BRCAl  gene  was  identified  in  September 
1994.  A  number  of  mutations  were  identified  in  one  of  the  candidate  genes  by  sequencing 
of  tumor  DNA  and  constitutional  DNA,  in  the  laboratory  of  Mark  Skolnick  and  Myriad 
Genetics,  Salt  Lake  City,  Utah.  By  using  the  primer  sequences  for  the  23  BRCAl  coding 
exons  it  was  then  possible  to  screen  individuals  for  germline  mutations  in  the  BRCAl  gene. 

A  total  of  30  Canadian  hereditary  breast  cancer  families  were  selected  from  over  100 
familjpR  collected  by  Dr.  Narod  for  the  initial  panel  chosen  for  sequence  analysis.  Ten  of 
these  famihes  were  sequenced  in  the  laboratory  of  Myriad  genetics  (Utah)  and  20  families 
were  sequenced  in  Quebec  by  the  team  of  Dr  Narod,  including  Dr.  Jacques  Simard  in 
Quebec  cify.  The  PCR  amplified  fragments  were  sequenced  in  both  the  coding  and 
non-coding  orientations.  This  approach  guaranteed  the  greatest  degree  of  mutation 
sensitivity,  which  was  essential  at  the  early  stages  of  mutation  detection.  A  total  of  14 
BRCAl  mutations  were  identified  in  the  30  famihes  (Simard  et  al,  1994).  This  experiment 
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was  noteworthy  in  that  the  first  examples  of  recurrent  mutations  were  identified  in  the 
Canadian  families.  In  four  families  the  mutation  185  del  (AG)  was  foxmd.  This  is  now  the 
most  commonly  reported  BRCAl  mutation.  In  another  four  famiHes  a  second  mutation, 
5728  ins  C,  was  found.  By  creating  genetic  haplotypes  it  was  evident  that  the  common 
mutations  were  seen  in  the  context  of  a  common  haplotype.  This  implies  that  the  patients 
with  these  mutatioiis,  although  previously  believed  to  have  been  unrelated,  have  a 
common  ancestor.  A  second  set  of  Canadian  families  has  now  been  screened  for  mutations. 

In  this  set  of  30  breast  cancer  famiHes  an  additional  four  mutations  have  been  found 
(manuscript  in  preparation).  Three  of  these  mutations  have  been  seen  before.  The  other  is  a 
novel  mutation  which  appears  to  be  unique  to  French-Canadians.  In  total,  18  BRCAl 
mutations  have  been  identified  in  60  Canadian  famiHes. 

In  additional,  a  further  20  breast-ovarian  cancer  famiHes  from  Creighton  University 
have  been  characterized  by  a  combination  of  Hnkage  analysis,  haplotype  analysis  and  direct 
sequencing  in  collaboration  with  Dr.  Heniy  Lynch  (Omaha,  Nebraska)  and  Dr.  Gilbert 
Lenoir  (Lyon,  France).  These  famiHes  have  been  ascertained  by  Dr.  Lynch.  The  Hnkage 
and  haplotype  analysis  was  performed  by  Dr.  Narod  in  Montreal.  The  sequencing  for 
BRCAl  mutations  was  done  by  Dr.  Lenoir  in  Lyon.  There  have  been  16  BRCAl  mutations 
identified  in  the  20  famiHes  from  Creighton  University.  These  famiHes  are  the  subject  of  a 
separate  report  (Serova  et  al,  submitted)  which  is  included  in  the  Appendix.  It  was 
noteworthy  that  in  six  of  the  16  famiHes  there  was  a  reduction  in  the  amount  of  BRCAl 
RNA  in  the  lymphoblastoid  cells.  This  impHes  that  the  presence  of  a  mutation  (most 
frequently  a  nonsense  or  frameshift  mutation)  leads  to  RNA  instabiHty  by  a  mechanism  that 
is  not  yet  understood. 

The  Hnkage  and  mutation  data  tiiat  is  generated  for  the  Creighton  famiHes  is  used 
for  a  genetic  counselling  evaluation  project  that  is  also  funded  by  the  Department  of  the 
Army  (Car)^  Lerman  PI).  Dr.  Narod  provides  the  risk  assessment  data  used  for  this  study 
and  has  to  date  provided  DNA-based  risk  assessments  to  over  250  individuals  in  these  20 
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families.  The  funds  for  the  laboratory  component  of  this  study  are  derived  from  the  current 
project  The  Cloning  of  the  BRCAl  Gene.  Dr  Narod  does  not  receive  any  additional  funding 
for  this  project  from  Dr.  Lerman. 

6.4  Genetic  epidemiology  of  BRCAl.  After  the  cloning  of  BRCAl  much  of  the  effort  in 
Dr.  Narod  laboratory  focused  on  studies  of  the  range  and  frequency  of  BRCAl  mutations  in 
the  population  and  the  use  of  this  information  for  the  rapid  detection  of  mutations. 

It  was  suggested  by  the  study  of  the  first  30  Canadian  families  that  a  small  number 
of  common  mutations  could  possibly  accotmt  for  the  majority  of  BRCAl  mutations  in  the 
population  (Simard  et  al,  1994).  Eight  of  tihe  first  14  mutations  were  recurrent  In  every 
case,  a  recurrent  mutation  was  seen  to  be  associated  with  a  single  haplofype.  This 
observation  led  to  tiie  hypothesis  that  by  genotyping  a  small  number  of  markers  from  the 
region  surrounding  the  BRCAl  gene  it  would  be  possible  to  predict  the  associated  germUne 
mutation  by  inspection  of  the  segregating  haplotype.  This  would  result  in  a  great  saving  of 
effort  as  it  would  not  be  necessary  to  sequence  the  entire  gene.  To  evaluate  this  possibility. 
Dr.  Narod  is  directing  a  collaboration  to  establish  the  corresponding  haplotypes  for  200 
breast-ovaiy  cancer  families.  To  date,  over  150  different  families  have  been  identified  by 
collaborators  and  the  haplotype  analysis  of  these  is  underway  in  the  laboratoiy  of  Dr. 
Narod.  Results  are  available  on  80  families.  Early  results  indicate  that  flie  majority  of 
recurrent  mutations  are  associated  with  a  common  haplotype.  However,  the  association 
between  haplotype  and  mutation  is  imperfect  because  of  genetic  recombination,  mutation 
at  the  marker  locus,  recurrent  de  novo  mutation  and  the  presence  of  one  frequent  haplotype 
(approx  35%)  in  the  general  population.  It  is  anticipated  that  this  study  will  be  completed 
in  March  1996. 

In  the  paper  of  Simard  et  al  (1994)  there  were  four  families  reported  with  the  185  del 
AG  mutation.  It  was  noted  that  aU  of  these  are  of  Ashkenazi  Jewish  origin.  This  finding 
was  extended  to  include  two  additional  families  and  it  was  noted  that  these  were 
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Ashkenazi  as  well  (Tonin  et  al,  1995).  This  specific  association  has  now  been  seen  in  over 
twenty  famihes  worldwide.  It  has  been  found  by  another  research  group  that  this  mutation 
may  be  present  in  1%  of  Ashkenazi  Jews  and  this  story  has  been  the  subject  of  much  media 
attention  recently. 

One  of  the  critical  questions  is  to  estimate  the  proportion  of  Jewish  women  with 
breast  and  ovarian  cancer  who  are  carriers  of  the  del  AG  mutation.  To  answer  this  question 
a  study  of  300  Jewish  patients  with  ovarian  cancer  is  planned.  These  patients  will  be 
selected  from  the  list  of  aU  living  patients  with  epithelial  ovarian  cancer  at  each  of  six 
hospitals  in  Canada  and  the  USA.  Each  will  be  interviewed  about  family  history  and 
reproductive  history  and  a  blood  sample  wiU  be  obtained  to  screen  for  the  185  del  AG 
mutation.  A  control  sample  of  400  Jewish  women  will  be  interviewed  about  family  and 
reproductive  histories  (no  blood  samples  obtained  from  controls).  Dr.  Narod  has  received  a 
supplement  of  37,920  dollars  from  the  National  Action  Committee  on  Breast  Cancer  to 
perform  this  study. 

A  similar  study  has  been  started  on  French  Canadian  women.  250  French-Canadian 
women  with  ovarian  cancer  have  been  selected  from  three  Quebec  hospitals.  To  date,  160 
women  have  been  enrolled  and  blood  samples  are  now  available  on  these.  BRCAl 
mutations  will  be  sought  in  the  familial  cases.  Once  recurrent  mutations  are  found,  the 
entire  study  group  of  250  women  will  be  screened  for  the  recurrent  mutations.  The  goal  of 
this  study  is  to  identify  the  frequent  mutations  in  this  ethnic  group  in  order  to  plan  for  a 
provincial  based  screening  program  for  hereditary  breast  and  ovarian  cancer. 

Dr.  Narod  was  one  of  the  founders  of  the  Breast  Cancer  Linkage  Consortium  and  is 
one  of  the  principal  contributors.  The  goals  of  this  collaborative  group  are  to  establish  tiie 
genetic  basis  for  famihal  breast  cancer  worldwide,  to  estimate  the  proportion  of  families 
attributable  to  each  of  the  major  breast  cancer  loci,  and  to  identify  the  phenotypic 
characteristics  associated  with  each  of  the  major  genes.  Dr.  Narod  analyzed  and  prepared  a 
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report  of  data  from  145  breast-ovarian  cancer  families  on  behalf  of  the  Breast  Cancer 
Linkage  Consortium  (Narod  et  al,  1995a).  The  study  estimated  that  76%  of  breast  ovary 
cancer  families  were  attributable  to  BRCAl.  If  the  family  had  no  cases  of  male  breast  cancer 
and  two  or  more  cases  of  ovarian  cancer,  the  estimated  linked  proportion  was  92%.  After 
the  BRCAl  gene  was  cloned  and  BRCA2  was  mapped  it  was  possible  to  reanalyze  the  data 
from  the  145  famihes  (Narod  et  al,  1995b).  There  were  10  apparently  unlinked  families  in 
the  original  report  BRCAl  mutations  were  subsequently  identified  in  three  famihes  and 
the  other  seven  famihes  are  linked  to  BRCA2.  None  of  the  original  145  famihes  is 
convincingly  unlinked  to  BRCAl  and  to  BRCA2. 

Not  all  carriers  of  BRCAl  mutations  develop  cancer,  and  for  those  who  do,  the  age 
of  onset  varies.  Some  women  develop  breast  cancer  and  other  develop  ovarian  cancer.  In 
an  attempt  to  determine  the  relevant  genetic  and  non-genetic  factors  which  contribute  to  the 
clinical  expression  of  the  BRCAl  gene.  Dr.  Narod  has  collected  DNA  samples  and 
associated  chnical  information  on  over  300  female  carriers  of  BRCAl  mutations.  By  using 
of  a  historical  cohort  design,  and  a  Cox  proportional  hazards  analysis,  it  has  been  possible 
to  study  the  effect  of  modifying  genes  and  of  reproductive  factors  on  cancer  penetrance 
(Narod  et  al,  in  press,  Phelan  et  al,  submitted).  It  was  found  that  parity  is  an  important  risk 
modifier  -  each  additional  birth  decreases  the  risk  of  breast  cancer  by  15%,  but  increases  the 
risk  of  ovarian  cancer  by  40%.  However,  the  risk  of  ovarian  cancer  is  decreased  by  a  late 
birth.  There  was  a  strong  cohort  effect  present  for  both  cancer  fypes;  the  risk  of  breast  and 
ovarian  cancer  is  roughly  double  for  women  bom  after  1930  than  for  women  bom  before 
1930.  Several  genetic  polymorphisms  have  been  evaluated  to  see  if  they  modify  the 
penetrance  of  BRCAl.  The  presence  of  a  rare  allele  of  the  HRASl  polymorphism  is 
associated  with  a  2.85-fold  increase  in  the  risk  of  ovarian  cancer  in  the  cohort  of  307  BRCAl 
carries  (p  =  0.002).  There  was  no  effect  on  the  penetrance  of  breast  cancer  by  the  HRASl 
locus.  Additional  genes  that  are  being  evaluated  for  modifying  effects  on  ttiis  cohort 
include  epoxide  hydrolase  and  CSTTMl. 
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In  this  panel  of  80  families  there  was  no  significant  association  between  mutation 
type  or  mutation  position  and  the  risk  of  breast  or  ovarian  cancer.  The  mutations  in  the  3' 
end  of  the  gene  were  associated  with  a  40%  reduction  in  ovarian  cancer  risk,  but  this 
difference  was  not  significant  (p  =  0.28). 

7.  Conclusions.  Through  a  combined  linkage  and  physical  mapping  approach  Dr.  Narod 
contributed  to  the  cloning  of  fl\e  BRCAl  gene  in  late  1994.  The  identification  of  this  gene 
has  led  to  many  important  questions  regarding  the  biology  of  the  gene,  the  population 
genetics  of  BRCAl  mutations  and  clinical  management  of  women  who  are  identified  to  be 
carriers  of  BRCAl  mutations.  Dr.  Narods  efforts  are  now  focused  on  developing  rapid 
methods  for  detecting  BRCAl  mutations  in  the  population  by  taking  advantage  of  the 
haplotype  and  ethnic  associations  of  known  mutations.  Collaborative  Networks  are  being 
established  to  enhance  the  study  of  the  modifying  genes  which  may  jointly  determine  the 
range  of  expression  and  the  penetrance  of  the  gene.  This  information  will  be  useful  in 
applying  molecular  technology  to  the  screening  and  counselling  of  women  at  high  genetic 
risk  for  breast  and  ovarian  cancer. 


11 


References  pertaining  to  this  grant 


1.  Miki  et  al,  and  44  oihers,  including  Najrod  SA.  A  strong  candidate  gene  for  the 
breast  and  ovarian  cancer  susceptibility  gene,  BRCAl.  Science  (1994)  266, 66-71. 

2.  Narod  SA,  Ford  D,  Devilee  P,  Barkardottir  RB,  Lynch  HT,  Smith  SA  et  al  (1995). 

An  evaluation  of  genetic  heterogeneity  in  145  breast-ovarian  cancer  famihes.  Amer  J 
Hum  Genet  56:254-264. 

3.  Narod  SA,  Ford  D,  Devilee  P,  Barkardottir  RB,  1  Lenoir  G,  Serova  O,  Easton 

D,  Goldgar  D  (1995b)  Genetic  heterogeneity  of  breast-ovarian  cancer  revisited. 

Amer  J  Human  Genet  57:  958-960 

4.  Narod  SA  et  al.  Risk  modifiers  in  BRCAl  carriers.  International  Journal  of  Cancer 
(in  press). 

5.  Neuhausen  SL,  Swensen  J,  Miki  Y,  Liu  Q,  Tavtigian  S,  Shattuck-Eidens  D,  Kamb  D, 
Hobbs  MR,  Gingrich  J,  Shizuya  H,  Kim  U-J,  Cochran  C,  Futreal  PA,  Wiseman  RW, 
L5mch  HT,  Tonin  P,  Narod  SA,  Cannon- Albright  L,  Skolnick  MH,  and  Goldgar 
DE.  (1994)  A  Pl-based  physical  map  of  the  region  from  D17S776  to  D17S78 
containing  the  breast  cancer  susceptibility  gene  BRCAl.  Human  Molecular  Genetics 
3(11):191^1926. 

6.  Phelan  CM,  Rebbeck  TR,  Weber  BL,  Devilee  P,  L5mch  HT,  Lenoir  GM,  Stratton  MR, 
Easton  DF,  Rutdedge  MH,Larsson  C,  Goldgar  DE,  Narod  SA.  The  risk  of  ovarian 
cancer  in  BRCAl  carriers  is  modified  by  the  HRASl  locus.  Submitted  to  Nature 
Genetics 

7.  Rommens  JM,  Durocher  F,  McArthur  J,  Tonin  P,  Leblanc  J-F,  AUen  T,  Samson  C, 
Ferri  L,  Narod  SA,  Morgan  K,  and  Simard  J.  (1995)  Generation  of  a  Transcription 
Map  at  the  HSD17B  Locus  Centromeric  to  BRCAl  at  17q21.  Genomics  28, 530-542. 

8.  Serova  O,  Narod  SA,  Tonin  P,  Feunteun  J,  SyUa  B,  Lynch  HT,  Lenoir  GM.  BRCAl 
mutation  analysis  of  20  breast-ovarian  cancer  families.  Submitted  to  American 
Journal  of  Human  Genetics. 

9.  Shattuck-Eidens  D,  McClure  M,  Simard  J,  Labrie  F,  Narod  SA,  Couch  F  et  al  (1995) 

A  collaborative  survey  of  80  mutations  in  the  BRCAl  breast  and  ovarian  cancer 
susceptibility  gene.  Implications  for  presymptomatic  testing  and  screening.  J  Amer 
Medic  Assoc  273:535-541. 


12 


References  (continued) 


10.  Simard  J,  Torun  P,  Durocher  F,  Morgan  K,  Rommens  J,  Gingras  S,  Samson  Q 
Leblanc  J-F,  Belanger  Q  Dion  F,  Liu  Q,  Skolnick  M,  Goldgar  D,  Shattuck  Eidens  D, 
Labrie  F,  and  Narod  SA.  (1994)  Common  origins  of  BRCAl  mutations  in  Canadian 
breast  and  ovarian  cancer  families.  Nature  Genetics  8: 392-398. 

11.  Tonin  P,  Serova  O,  Simard  J,  Lenoir  G,  Feunteun  J,  Morgan  K,  Lynch  H,  Narod  SA. 
(1994)  The  gene  for  hereditary  breast-ovarian,  BRCAl  maps  distal  to  EDH17B2  in 
chromosome  region  17ql2-21.  Hum  Molec  Genet  3(9):1679-1682. 

12.  Tonin  P,  Serova  O,  Lenoir  G,  Lynch  H,  Durocher  F,  Simard  J,  Morgan  K,  and 
Narod  SA.  (1995)  BRCAl  mutations  in  Ashkenazi  Jewish  women.  American 
Journal  of  Human  Genetics  57:189. 


13 


Appendix 


14 


RESEARCH  ARTICLES 


t.  ■ 


A  Strong  Candidate  for  the 
Breast  and  Ovarian  Cancer 
Susceptibility  Gene  BRCA1 

Yoshio  Miki,  Jeff  Swensen,  Donna  Shattuck-Eiden^  P.  Andrew  Fu^re 
Keith  Harshman.  Sean  Ta'rt'g'af^’ 

L  Michelle  Bennett,  Wei  Ding,  Russell  Bell^  Jud  th 
rharles  Hussev  Thanh  Tran,  Melody  McClure,  Cheryl  Fiye,  Torn  Nattier, 
Robert  Phelps  Astrid  Haugen-Strano,  Harold  Katcher, 

Zoh?fGhSami,  DanS  Shaffir,  Steven  Stone,  Steven  Bayer,  Christian  \J^ray, 
Robert  Boqderi,  Priya  Dayananth,  John  Ward,  Patricia  Toniti,  Steven  Narod, 
Pam  K  Bristow  Frank  H.  Norris,  Leah  Helvering,  Paul  Momson, 

Paul  Rosteck  Mei  Lai,  J.  Carl  Barrett,  Cathryn. Lewis,  Susan  Neuhau^n, 
Lisa  Cannon-Albright,  David  Goldgar,  Roger  Wiseman,  Alexander  Kamb, 

Mark  H.  Skolnick* 

A  strong  candidate  for  the  17q-linked  BRCA1  gene,  which  influences  f^isceptibility  to 
breLt  Ind  ovarian  cancer,  has  been  identified  by  positional  cloning  methods.  Prob Jle 
predisposing  mutations  have  been  detected  in  five  of  eight  kindreds  Pr®sumed  to  seg 
rAnfltp  BRCA1  susceptibility  alleles.  The  mutations  include  an  11-base  pair  delet  o  , 
"blL  pa?!ier^on  rS  a  missense  substitution,  and  an  inferred  regulatory 

mutation^  The  BRCA1  gene  is  expressed  in  numerous  tissues,  including  breast  and  ovary, 
rd^odes  a  predicted  protein  of  1 863  amino  acids.  This  protein  contains  a  zinc  finger 
domain  in  its  amino-terminal  region,  but  is  otherwise  unrelated  to  previously  descnbed 
proteins  Identification  of  9RCA1  should  facilitate  early  diagnosis  of  breaK  and  o^n^ 
S  suSptibillty  in  some  indiyiduais  as  well  as  a  better  understanding  of  breast 

cancer  biology. 


Breast  cancer  is  one  of  the  most  common 
and  important  diseases  affecting  women. 
Current  estimates  indicate  that  one  in  eight 
American  women  who  reach  age  95  will 
develop  breast  cancer  (I).  Treatment  of 
advanced  breast  cancer  is  often  futile  and 
disfiguring,  making  early  detection  a  high 
priority  in  medical  management  of  the  dis¬ 
ease.  Ovarian  cancer,  although  less  frequent 
than  breast  cancer,  is  often  rapidly  fatal  and 
is  the  fourth  most  common  cause  of  cancer 
mortality  in  American  women. 
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Generic  factors  contribute  to  an  ill-de¬ 
fined  proportion  of  breast  cancer  incidence, 
estimated  to  be  about  5%  of  all  cases  but 
approximately  25%  of  cases  diagnosed  be¬ 
fore  age  30  (2).  Breast  cancer  has  been 
subdivided  into  two  types,  early-onset  and 
late-onset,  a  division  that  is  based  on  an 
inflection  in  the  age-specific  incidence 
curve  around  age  50.  Mutation  of  one  gene, 
BRCAi,  is  thought  to  account  for  approxi¬ 
mately  45%  of  families  with  significantly 
high  breast  cancer  incidence  and  at  least 
80%  of  families  with  increased  incidence  of 
both  early-onset  breast  cancer  and  ovarian 
cancer  (3).  Intense  efforts  to  isolate  the 
BRCAI  gene  have  proceeded  since  it  was 
first  mapped  to  chromosome  arm  17q  in 
1990  (4,  5).  A  second  locus,  BRCA2,  re¬ 
cently  mapped  to  chromosome  arm  13q  (6), 
appears  to  account  for  a  proportion  of  early- 
onset  breast  cancer  roughly  equal  to  that 
resulting  from  BRCAI.  Unlike  BRCAI, 
however,  BRCA2  may  not  influence  ovari¬ 
an  cancer  risk.  The  remaining  susceptibility 
CO  early-onset  breast  cancer  is  likely  attrib¬ 
utable  to  unmapped  genes  for  familial  can¬ 
cer  and  rare  germline  mutations  in  genes 
such  as  TP53,  which  encodes  the  tumor 
suppressor  protein  p53  (7)-  It  has  also  been 
suggested  that  heterozygote  carriers  of  de¬ 
fective  forms  of  the  gene  predisposing  co 
ataxia  telangiectasia  are  at  higher  risk  for 
breast  cancer  (8,  9).  Late-onset  breast  can- 
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cer  is  often  familial  in  origin,  although  the 
risks  in  relatives  are  not  as  high  as  those  for 
early-onset  breast  cancer  (10,  1 1 ).  The  per¬ 
centage  of  such  cases  that  are  due  to  genetic 
susceptibility  is  unknown. 

Like  many  other  gebes  involved  in  fa¬ 
milial  cancer,  BRCAI  appears  to  encode  a 
tumor  suppressor,  a  protein  that  acts  as  a 
negative  regulator  of  tumor  growth.  Can¬ 
cer-predisposing  alleles  typically  carry  mu¬ 
tations  that  cause  loss  or  reduction  of  gene 
function.  Predisposition  to  cancer  is  inher¬ 
ited  as  a  dominant  genetic  trait,  whereas 
the  predisposing  allele  generally  behaves  as 
a  recessive  allele  in  somatic  cells.  Thus,  a 
single  inherited  copy  of  the  mutant  allele 
causes  predisposition,  and  loss  or  inactiva¬ 
tion  of  the  wild-type  allele  completes  one  of 
the  steps  in  progression  toward  malignancy. 
When  chromosome  loss  is  observed  in 
breast  and  ovarian  tumors  from  patients 
who  carry  BRCAI  predisposing  alleles,  the 
wild-type  copy  of  BRCAI  is  invariably  lost 
while  the  presumptive  mutant  allele  is  re¬ 
tained  (12-14),  This  finding  supports  the 
hypothesis  that  BRCAI  is  a  tumor  suppres¬ 
sor  gene  and  suggests  that  the  functional 
BRCAI  protein  is  present  in  normal  breast 
and  ovarian  epithelium  tissue  and  is  altered, 
reduced,  or  absent  in  some  breast  and  ovar¬ 
ian  tumors. 

Genetic  analysis  of  recombinant  chromo¬ 
somes  in  members  of  large  kindreds  allowed 
localization  of  BRCAI  mitially  to  a  region  of 
1  to  2  megabases  on  chromosome  17q  (15- 
17)  and,  subsequently,  to  a  region  of  about 
600  kilobase  pairs  (kb)  (18)  between  markers 
D17S1321  and  D17S1325  (19),  A  physical 

map  comprised  of  overlapping  yeast  artificial 
chromosomes  (YACs),  PI,  bacterial  artificial 
chromosomes  (BACs),  and  cosmid  clones  was 
generated  for  this  region  (18). 

Identification  of  a  strong  BRCAI  can¬ 
didate  gene.  Several  strategies  were  used  to 
develop  a  detailed  map  of  transcripts  for  the 
600-kb  region  of  17q21  between  D17S1321 
and  DI7S1325.  Sixty-five  candidate  ex¬ 
pressed  sequences  (20)  within  this  region 
were  identified.  Expressed  sequences  were 
characterized  by  DNA  sequence,  database 
comparison,  transcript  size,  expression  pat¬ 
tern,  genomic  structure  and,  most  impor¬ 
tantly,  DNA  sequence  analysis  in  individu¬ 
als  from  kindreds  that  segregate  17q-Unked 
breast  and  ovarian  cancer  susceptibility. 
Three  expressed  sequences  eventually  were 
merged  into  a  single  transcription  unit 
whose  characteristics  strongly  suggest  that 
it  is  BRCAi  (21).  This  transcription  unit  is 
located  in  the  center  of  the  600-kb  region 
(Fig.  1)  spanning  DI7S855  and  will  be  re¬ 
ferred  to  herein  as  BRCAI . 

A  combination  of  sequences  obtained 
from  complementary  DMA  (cDNA)  clone^ 
hybrid-selected  sequences,  and  amplified 
polymerase  chain  reaction  (PCR)  products 
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allowed  the  construction  of  a  composite, 
full-length  BRCAl  cDNA.  The  cDNA 
clone  extending  farthest  in  the  3  direction 
contains  a  polyadenylate  tract  preceded  by 
a  polyadenylation  signal.  Conceptual  trans¬ 
lation  of  the  cDNA  revealed  a  single,  long 
open  reading  frame  with  a  presumptive  ini¬ 
tiation  codon  flanked  by  sequences 
bling  the  Kozak  consensus  sequence  (22). 
This  reading  frame  encodes  a  protein  of 
1863  amino  acids  (Fig.  2A).  Smith-Wa^ter- 
man  (23)  and  BLAST  (24)  searches  iden¬ 
tified  a  sequence  near  the  NH^-terminus 
that  has  considerable  similarity  to  zinc  fin¬ 
ger  domains  (25)  (Fig.  2B).  This  sequence 
contains  cystine  and  histidine  residues 
present  in  the  consensus  Cys3-His-Cys4 
(C3HC4)  zinc  finger  motif  and  shares  many 
other  residues  with  zinc  finger  proteins  in 
the  databases.  The  BRCAl  gene  is  com¬ 
posed  of  22  coding  e.xons  distributed  over 
roughly  100  kb  of  genomic  DNA  (Fig.  3). 

Hybridization  of  RNA  blots  to  labeled 
fragments  of  BRCAl  cDNA  revealed  a  sin¬ 
gle  transcript  of  7.8  kb.  This  transcript  is 
most  abundant  in  testis  and  thymus,  but  U 
also  present  in  breast  and  ovary  (Fig.  4). 
The  cDNA  clones  derived  from  the  5'  one- 
third  of  BRCAl  transcripts  display  a  com¬ 
plex  pattern  of  alternative  splicing.  Four 
alternative  splices  were  observed  down¬ 
stream  of  the  start  codon  as  independent 
cDNA  clones  (P3,  P4,  B31,  and  B21  in  Fig. 
3)-  three  of  these  patterns  were  detected  in 
breast  cDNA  (P3,  B31,  and  B21)  and  two 
in  ovary  cDNA  (P3  and  B21).  In  addition, 
PCR  analysis  of  cDNA  samples  prepared 
from  breast,  ovary,  testis,  and  lymphocyte 
messenger  RNA  (mRNA)  indicates  Aat 
there  is  considerable  heterogeneity  in  splice 
junction  usage  near  the  5'  end  of  BRCAl 
transcripts,  upstream  of  the  presumptive 
Lniciacion  codon.  How  this  alternative  splic- 
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ing  is  coordinated  with  alternative  splicing 
farther  downstream,  and  whether  all  the 
splice  variants  produce  proteins  with  an 
identical  NH2-terminus,  are  questions  that 
remain  to  be  explored. 

We  also  probed  genomic  DNA  samples 
from  several  different  species  with  BRCAl 
sequences  devoid  of  the  zinc  finger  region. 
Low-Stringency  blots  revealed  strongly  hy¬ 
bridizing  fragments  in  tissues  from  humans, 
mice,  rats,  rabbits,  sheep,  and  pigs,  but  not 
chickens  (Fig.  5).  These  results  suggest  that 


BRCAl  is  conserved  in  mammals. 

Germline  BRCAl  mutations  in  17q- 
linked  kindreds.  Identification,  of  a  candi¬ 
date  gene  as  BRCAl  requires  a  demonstra¬ 
tion  of  potentially  disruptive  mutations  in 
that  gene  in  carrier  individuals  from  kin¬ 
dreds  that  segregate  17q-linked  susceptibil- 
iry  to  breast  and  ovarian  cancer.  Such  in¬ 
dividuals  must  contain  BRCAl  alleles  that 
differ  from  the  wild-type  sequence.  The  set 
of  DNA  samples  used  in  this  analysis  con¬ 
sisted  of  DNA  from  individuals  represent- 
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Fig,  1,  Schematic  map  of  human  chromosome 
1 7.  The  pertinent  region  containing  BRCAl  is  ex¬ 
panded  to  indicate  the  relative  positions  of  two 
previously  identified  genes,  CA125  (34)  and 
RNU2  (45).  Dl  7S855  is  located  within  BRCAl. 
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1800  GT^IVVVQPDAWIEDNGFHMGQMCEAPVVTREWtrt-DSVALYQ 

1860  SHY  ^  _ j _ _ I - 1 - I 

B 

^ptpt  rltKEPVSTK-CDHIFCKFCMLKLLNQKK  gpsqcplck 

C3HC4  motif  C — C - -C  C 

F„.  2.  P„di«  »<no 

reading  frame,  indicating  the  Thfi-11  bo  deletion  in  kindred  1901  Is  shown  by  an 

locations  of  germline  mutations  Jioldfacere^^^^  frameshift  in  kindred  1910  is 

asterisk;  the  nonsense  mutation  in  J®  .  2Qggf  t,y  “R”.  The  BRCAl  nucleotide 

shown  by  “  +c'’ ;  and  the  missense  mutatio  rnhor  i  ii  PCR  orimer  sequences  are  available 

sequence  is  deposited  inGenBank  with  acce^onmmberUU^PC^^^^^ 

via  anonymous  FTP  at  the  following  internet  , . .  qoq/^  7  2inc  finger  domain  with 

or  by  fax  at  the  following  number.  801  ^ 

three  other  zinc  finger  domains  that  ®  J  .  .  ,0  mice  (47).  RIN1  is  a  DNA  binding 

that  appears  to  be  a  negative  regulator  of  t  e  in  putative  transcription 

Ihe  positions  of  L  cystines  and  the  histidine  that  form  the  zinc  binding  pockets. 


ing  ei<^ht  different  BRCA 1  kindreds  ^able 
1)  The  lod  scores  (likelihood  ratios  ior 
linkage)  in  these  kindreds  range  from  949  I 
jQ  -0.44  for  a  set  of  markers  in  17ci2l.  Four 
of  the  families  have  convincing  lod  scores 
for  linkage,  and  four  ^ve  low 
negative  lod  scores.  The  latter  kindreds 
were  included  because  they  demonstrate 
haplotype  sharing  at  chromosome  17q2 1  tor 
at  least  three  affected  members.  Further¬ 
more,  all  kindreds  in  the  set  display  early 
onset  breast  cancer,  and  four  of  the  kindreds 
include  at  least  one  ca.^  of 
both  hallmarks  of  BRCAl  kindreds.  Kin¬ 
dred  2082  has  nearly  equal  incidence  ot 
breast  and  ovarian  cancer,  an  unusual  oc¬ 
currence  given  the  relative  rarity  of  ovarian 
cancer  in  the  population  (17).  AH  but  two 
of  the  kindreds  were  ascertained  in  Utah. 
Kindred  2035  is  from  the  midwesterrr  Unit¬ 
ed  States.  Kindred  2099  is  an  A  ncan 
American  kindred  from  the  southern  Unit¬ 
ed  States;  all  other  kindreds  are  Caucasian. 

In  the  initial  screen  for  predisposing  tnu- 
tations  in  BRCAl ,  DMA  from  one  individ¬ 
ual  carrying  the  predisposing  hapbwe 
from  each  kindred  was  tested.  The  21  cod¬ 
ing  exons  and  associated  splice  J^octions 
were  amplified  from  either  genomic  DNA 
samples  or  cDNA  prepared  from  lympho¬ 
cyte  mRNA  (26).  When  the  amplified 
DNA  sequences  were  compared  to  the  ivild- 
type  sequence,  four  of  the  eight  in  re 
samples  were  found  to  contain  sequence 
variants  (Table  2).  All  four  sequence  vari¬ 
ants  are  heterozygous,  and  each 
only  one  of  the  kindreds.  Kindred  1901 
contains  an  11-base  pair  (bp)  deletion  m 
exon  2  (Cys24  frameshift  to  36  Stop).  Kin 
dred  2082  contains  a  nonsense  mutatioir  in 

coding  exon  11  (Glnl313  to  Stop)  (Fig. 


6A),  Kindred  1910  contains  a 
otide  insertion  in  coding  exon  20  (Gbl756 
frameshift  to  1829  Stop)  (Fig.  6B),  and 
kindred  2099  contains  a  missense  mutanon 
in  coding  exon  21  (Metl775Arg).  e 
frameshift  and  nonsense  mutations  hke- 

ly  to  disrupt  the  function  of  the  BRCAl 

proteins.  The  protein  encoded  by  the  inser- 
don  allele  in  kindred  1910  wou  d  contain 
an  altered  sequence  beginning 
acids  residues  from  the  wild-type  COTH- 
terminus.  The  effect  of  the  11-bp  deletion 
in  kindred  1901  would  be  even  more  dra¬ 
matic  because  it  occurs  at  the 
codon.  This  deletion  removes  the  last  1 1  bp 
of  exon  2  and  begins  at  the  first  cystine  of 
the  zinc  finger  motif,  hereby 
zinc  finger  domain.  The  mutant  allele  in 
kindred  2082  would  encode  a  protein  miss¬ 
ing  548  residues  from  the  COOH-terminus. 


The  missense  mutation  observed  in  kindred 
2099  is  potentially  disruptive  as  -sti- 

tutes  a  large,  charged  amino  acid  (Arg  lor 
a  small,  hydrophobic  amino  acid  (Met). 
Five  common  polymorpltisms  were  also 
identified  in  the  BRCAl  coding  sequence 

^  The  individual  studied  in  kindred  2035 
i<;  likely  to  carry  a  regulatory  mutation  in 
BRCAl.  In  her  cDNA,  two  polymorphic 
sites  (PMl  and  PM7)  appeared  homozy¬ 
gous,  whereas  her  genomic  DNA  reveabd 
heterozygosity  at  these  positions  (Fig.  6C). 
One  possible  explanation  for  this  observa¬ 
tion  is  that  mRNA  from  her  mutant 
BRCAl  allele  is  absent  because  of  a  muty 
tion  that  affects  RNA  Production  or  stabil¬ 
ity  We  explored  this  possibility  further  by 
examining  three  additional  Pr^^^vyorphic 

sites  (PM6,  PM7,  and  PM2)  in  the  BRCAl 


(cin\  Rr  cancer;  Br  <50,  br©ast  cancer 

Table  1.  Kindred  descriptions  and  associated  lod  scores  (50).  Br.  breast 

diagnosed  under  age  50;  Ov,  ovarian  cancer. _ _ _ _ _ _ 


Cases  (n) 


Kindred 


Sporadic  Lod 

cases t  score 

(n) 


Markers 


Mr  _ - _ _ _ 

_ _ — - - ^  r  QAQ  D17S1327 

s  s  s  1-  ■  0  “'Sir 

2.25  D17S1327 

2035  10  8  1,  Q  1.50  017S855 

1901  18  ^  n  0  0.55  D17S579 

^5  4  3  0  0  DJ7S579and 

®10  540  0  D17S250t 

_0.20  D17S250 

1911  8  5  0  1  D17S250 

calculate  multipoint  lod  scores. 


mm 


Fig.  3.  Diagram  of  BRCAl  BRCA1 

mRNA,  showing  the  ioca-  _ 

lions  of  introns  and  the  vari¬ 
ants  of  BRCA1  mRNA  pro-  P4 

duced  by  alternative  splic-  Breast  cDNA 

ing.  The  top  cDNA  (BRCAl) 

is  the  composite  used  to  ^ 
generate  the  protein  se-  TTS 

quence  in  Fig.  2.  Intron  loca-  TE2 

lions  are  shown  by  filled  tri-  q21  - 1  1 1 

angles,  and  the  exons  are  ^  — IJI  _  _ ^ 

numbered  below  the  com- 

posite  cDNA.  Alternative  TY4  i  _ ^ 

mRNAs  identified  as  cDNA  P103  Hy 

clones  or  in  hybrid-selection  ^ 

experiments  are  shown  be-  t> - — o 

low  the  composite.  The  start 
codon  (ATG)  and  stop  codon 

(TGA)  are  indicated.  The  zinc  89  |  | 

finger  region  is  denoted  noted  indicate  the  absence  of  internal 

by  a  double  line.  ‘•V  lines  (indicated  with  a 

exons.  All  exons  are  drawn  ®  3  the  position  of  a  single 

dotted  line).  Upward-pointing  un  i  □  Ld  1 4  in  some  cDNAs.  Leftward- 

codon  (CAG)  found  at  the  start  o  ex  ^  ggent  partial  exons  in  some 

romfra  srs  "O'”  » i-'”""* 


TY3  and  TE2  are  5'  RACE  breasTcD^^^^  ‘s  a  hybrid- 

and  B9  are  cDNA  clony  from  a  non  al  b  ^ 

. 
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coding  region,  which  are  separated  by  as 
^^ch  as  3  5  kb  in  the  BRCAl  transcript.  In 

riS»h«eh„g»o™cDNAapp»jed 
heterozygous  for  a  polymorphism,  her 
cDNAippeared  homozygous.  In  individuals 
from  othlr  kindreds  and  in  nonhaplotype 
carriers  in  kindred  2035,  these  PoWmorphm 
sites  appeared  heterozygous  in  cDNA,  im 
plying  that  amplification  frorn  cWA  was 
biased  in  favor  of  one  allele.  This  anal- 
Ss  Indicates  that  a  BRCAl  mutation  in 
kindred  2035  either  prevents  transcription 
or  causes  instability  or  aberrant  splicing  of 
j  rbe  BRCAl  transcript. 

Cosegregation  of  BRCAl  mutations 
with  BRCAl  haplotypes  and  population 
Trequency  analysis.  In  addition  to  potential 
dismption  of  protein  function,  a  sequence 
variant  must  meet  two  other  criteria  to 
qualify  as  a  candidate  predisposing  mut  - 
tion:  It  must  be  present  members  ot  the 
kindred  who  carry  the  predisposing  haplo- 
and  absent  from  other  members  of  he 
kILred,  and  it  must  be  rate  in  the  general 

^  To 't^'t  for  cosegtegation  of  mutations 
with  the  corresponding  BRCAl  ' 

.  1910.  2082,  2099, 


mduding  boih  amm  '  h 

the  predisposing  haplotype  (Fig.  6).  In  each 
kindred,  the  corresponding  mutant  allele 
;Ts  detected  only  in  individuals  carrying 
the  BRCAl -associated  haplotype.  In 
case  of  the  potential  regulatory  mutation  in 
kindred  2035,  cDNA  and  genomic  DNA 
from  carriers  in  the  kindred  were  compared 
for  heterozygosity  at  Polymorphic  sites  n 

every  instance,  the  extinguished  allele  m 

the  cDNA  sample  was  shown  ^ 
chromosome  that  carries  the  BRCA  p 

‘^‘'?rScilIde'the  possibility  that  the  mu- 
rations  were  simply  ^  Pf'S' 

phisms  in  the  P°P^l®“°^; "  S 
specific  oligonucleotides  (ASOs)  fot  eac 
mutation  to  screen  a  set  of  contro 
samples  (27).  The  actual  mutation  in  kin- 

Td  203Was  not  been  identih^.  so  we 

could  not  determine  its  frequency  m  the 
general  population.  Gene  frequency  est^ 
mates  in  Caucasians  were  based  on  ra^om 
samples  from  the  Utah  population.  Gene 
freqLncy  estimates  in  African  Americans 


i  ”  -  ^  ’ll  o. 

X  2  5  o  cr  cn  a. 


9  c  = 
^  o  fl 

I  S  ^ 


Fig  4.  Tissue  expression  pattern 
wL  were  obtained  from  Clontech  (Palo  Alto,  CA) 
and  contain  RNA  from  the  indicated  tissues.  Hy 
SiX  tonailons  «.re 

cDNA  fragment  corresponding  to  oocleotide 
3575  to  3874.  Note  that  these  tissues  are  hete 
Sneous  and  the  percentage  o«  mievant  epilhe^ 
lial  cells  in  breast  and  ovary  can  be  variable.  Siz 
standards  are  in  kiiobases. 


Pin  5  Blot  showing  hybridization  of  a  BRCAl 
species.  DNA  was  digested  with  Eco  Rl,  sub^t 

Size  standards  are  in  kiiobases. 


were  based  on  39  samples  used  in  linkage 
srudies  and  on  samples  from  20  African 
A„enc».  from  Unh  (28).  N“«  »/  'I;' 

four  potential  predisposing  mutation  tested 
was  found  in  the  appropriate  control  p  p 
Uton,  indlcacmg 
gonoml  popolatioo  (Table 

important  requirements  for  BRCAl  suscep 
tiUlity  alleles  are  fulfilled  by  the  candidate 

predisposing  mutations:  cosegregation  of 
L  mutant  allele  with  diseases  and  ab- 
enc^of  the  mutant  allele  in  controls 

Indicating  a  low  frequency  in  the  general 

Phenotypic  expression  of  BRCAl  inu- 

tations.  The  effect  of  the 

BRCAl  protein  correlates  with  differences 

"  the  oLerved 

the  BRCAl  kindreds.  Most  BRCAl  k 
dreds  have  a  moderately  increased  ovarian 
cancer  risk,  and  a  smaller  subset  have  a  h|J 
risk  of  ovarian  cancer  comparable  to  that 
for  breast  cancer  (3).  Four  of  the  five  kin¬ 
dreds  in  which  BRCAl  mutations  were  de¬ 
tected  fall  into  the  former  category  and  the 
Sth  (kindred  2082)  ^lls ,  into  the  group 
with  high  ovarian  cancer  risk  The  BRCA 
Lnsenie  mutation  found  in  kmd^d  gg 
L  an  interesting  phenotype.  Kmdred  2082 
has  a  high  incidence  of  ovarian  cancer,  and 

older  than  that  in  the  other  ^mdreds  (  7)^ 
This  difference  in  age  of  onset  cou 
to  an  ascertainment  bias  in  the  smaller 
more  highly  penetrant  families,  or  could 

reflect  tissue-specific  differenws  in  the  be¬ 
havior  of  BRCAl  mutations,  "rh^f 
kindreds  that  segregate  known  BRCAl  rnu 
rations  have,  on  average,  1  ovarian  can 
for  every  10  cases  of  breast  cancer,  but  have 
a  iS  proportion  of  breast  c^cer  cases 
diagnosed  at  an  early  age  (late  20s  “t  ear  y 
A7  30s).  Kindred  1910,  which  has  a  I'hP  '^r 
,us  tion  mutation,  is 

ct-  of  the  four  affected  individuals  had  bilateral 
>ss  last  cancer,  and  in  each  case  the  se  ond 

ich  tumor  was  diagnosed  within  a  year  of  the 

3  first  occurrence.  Kindred  2035.  ^hich  g 

regates  the  potential 

f,!  mutation,  might  also  be  expected  to  have 


1  mna  in  BRCAt  NA  indicates  not  applicable,  as  the  regulatory  mutation  is 
ToKio  9  PredisDOsing  mut3tions  in  fcino  ■ 

ilX  sS  S  MS  _ _ _ 


Mutation 


Kindred 


Codon 

Nucleotide 

Coding 

effect 

change 

24 

1313 

1756 

-11  bp 

C-^T 

Extra  C 

Franieshift  or  splice 
Gln-^Stop 
Frameshift 
Met->Arg 

1775 

T— 

? 

Loss  of  transcript 

NA 

Frequency 
in  control 
chromosomes 


Tahia  3  Neutral  polymorphisms  in  BRCAl.  For 

at  methdicated  polymorphic  site  is  shown  (A,  C, 
G,  orT).  _ ' 

Frequency  in  control 
Codon  Base  chromosomes 


PM1  317 
PM6  878 
PM7  1190 
PM2  1443 
Plv13  1619 


152  0  10  0 

0  55  0  100 

109  0  53  0 

0  115  0  58 

116  0  52  0 
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dramatic  phenotype.  Eighty  percent  o 
breast  cancer  cases  in  this  kindred  occur 
under  age  50.  This  figure  is  as  high  as  any  in 
the  set,  suggesting  that  this  BRCAl  mutant 
allele  has  a  high  penetrance  (Table  1).  Kin¬ 
dred  1901  displays  a  phenotypic  pattern 
similar  to  that  of  kindred  2035.  It  is  likely 
that  the  ll-bp  deletion  beginning  at  codon 
24  carried  in  kindred  1901  results  in  a  loss 
of  gene  function  similar  to  the  effect  of  the 
regulatory  mutation  in  kindred  2035. 

Although  the  mutations  described  in 
this  research  article  are  clearly  deleterious, 
causing  breast  cancer  in  women  at  very 
young  ages,  each  of  the  four  kindreds  with 
mutations  includes  at  least  one  woman  who 
carried  the  mutation  but  lived  until  age  80 
without  developing  a  malignancy.  It  will  be 
of  utmost  importance  in  future  studies  to 
identify  other  genetic  factors  or  environ¬ 
mental  factors  that  may  ameliorate  the  ef¬ 
fects  of  BRCAi  mutations.  In  addition,  in 
three  of  the  eight  putative  BRCAl -linked 
kindreds,  potential  predisposing  mutations 
were  not  found.  All  of  these  kindreds  have 
lod  scores  for  BRCA 1  -linked  markers  of  less 
than  0.55  and  thus  may  not  truly  segregate 
BRCAl -predisposing  alleles.  Alternatively, 
the  mutations  in  these  three  kindreds  may 
lie  in  noncoding  regions  of  BRCAl  and 
therefore  have  escaped  detection. 

The  role  of  BRCAl  in  cancer.  Most 


mutant  tumor  suppressor  genes  identified  to 
date  encode  proteins  that  are  absent,  non¬ 
functional,  or  reduced  in  function.  The  ma¬ 
jority  of  TP53  mutations  are  missense;  some 
of  these  have  been  shown  to  produce  ab¬ 
normal  p53  molecules  that  interfere  with 
the  function  of  the  wild-type  product  (30, 
31).  A  similar  dominant-negative  mecha¬ 
nism  of  action  has  been  proposed  for  some 
adenomatous  polyposis  coli  (ARC)  alleles 
that  produce  truncated  molecules  (32)  and 
for  point  mutations  in  the  Wilms  tumor 
gene  (WTl),  which  alter  DNA  binding  of 
the  WTl  protein  (33).  The  nature  of  three 
mutations  observed  in  the  BRCAi  coding 
sequence  is  consistent  with  production  of 
either  dominant-negative  proteins  or  non¬ 
functional  proteins.  All  three  mutations  are 
located  in  the  COOH-tcrminal  half  of  the 
protein.  The  regulatory  mutation  inferred 
in  kindred  2035  cannot  be  dominant-nega¬ 
tive;  rather,  this  mutation  likely  causes  re¬ 
duction  or  complete  loss  of  BRCAi  expres¬ 
sion  from  the  affected  allele.  Similarly,  the 
ll-bp  deletion  in  kindred  1901  likely  pro¬ 
duces  a  nonfunctional  product. 

The  BRCAl  protein  contains  a  C3HC4 
zinc  finger  domain  similar  to  domains  found 
in  numerous  nucleic  acid  binding  proteins. 
The  first  180  amino  acids  of  BRCAl  con¬ 
tain  five  more  basic  residues  than  acidic 
residues.  In  contrast,  the  remainder  of  the 


molecule  is  very  acidic,  with  a  net  excess  of 
70  acidic  residues.  The  excess  negative 
charge  is  particularly  concentrated  near  the 
COOH-terminus.  Thus,  one  possibility  is 
that  BRCAl  encodes  a  transcription  factor 
with  an  NH^-terminal  DNA  binding  do¬ 
main  and  a  COOH-terminal  “acidic  blob” 
domain  with  transactivational  activity.  In¬ 
terestingly,  the  product  of  another  familial 
tumor  suppressor  gene,  WTl,  also  contains 
zinc  finger  domains  (34),  and  these  are 
altered  by  many  cancer-predisposing  muta¬ 
tions  in  the  gene  (33-35).  The  WTl  gene 
encodes  a  transcription  factor,  and  altema-  | 
tive  splicing  of  exons  that  encode  parts  of  I 
the  zinc  finger  domains  alters  the  DNA  ^ 
binding  properties  of  WTl  (36).  Some  a  I-  | 

ternatively  spliced  forms  of  WTl  mRNA 
generate  WTl  proteins  that  act  as  tran-  | 
scriptional  repressors  (37).  Differential 
splicing  of  BRCAl  may  alter  the  zinc  finger 
motif  (Fig.  3),  raising  the  possibility  that  a 
regulatory  mechanism  similar  to  that  occur¬ 
ring  in  WTl  may  apply  to  BRCAl . 

The  identification  of  a  gene  that  (i)  falls 
within  the  interval  known  from  genetic 
studies  to  include  BRCAl  and  (ii)  contains 
frameshift,  nonsense,  and  regulatory  muta- 
;  tions  that  cosegregate  with  predisposing 
BRCAl  alleles  strongly  indicates  that  this 

■  gene  is  BRCAl.  The  observadon  of  poten¬ 
tial  predisposing  mutations  in  individuals 


Fig.  6.  Mutation  and  cosegregation 
analysis  in  BRCAI  kindreds.  Carrier  in¬ 
dividuals  are  represented  as  filled  cir¬ 
cles  and  squares  in  the  pedigree  dia¬ 
grams,  (A)  Nonsense  mutation  in  kin¬ 
dred  2082.  P  indicates  the  person  orig¬ 
inally  screened;  b  and  c  are  haplotype 
carriers;  a,  d,  e,  f,  and  g  do  not  carry  the 
BRCA  1  haplotype.  The  C  to  T  mutation 
results  in  a  stop  codon  and  creates  a 
site  for  the  restriction  enzyme  Avr  11. 

PCR  amplification  products  were  cut 
with  this  enzyme,  subjected  to  electro- 

phoresis  through  1 .0%  agarose  gels,  and  stained  with  ^ 

carriers  are  heterozygous  for  the  site  and,  therefore,  show  three  bands.  Th 
PCR  products  of  noncarriers  remain  uncut  by  Avr  II  and  therefore,  show  one 
band.  Size  standards  are  in  base  pairs.  (B)  Frameshift  mutation  in  l^ndred 
1910.  Sequencing  reactions  were  loaded  side  by  side  as  A,  ,  ■ 

T  reactions.  The  first  three  lanes  for  each  nucleotide  set  contain  sequence 
[adders  from  noncarriers.  Lanes  1  to  3  contain  A  'feiers  from  earner 
individuals.  Lane  4  contains  the  A  ladder  from  a  kindred  rnember  who  does 
not  carry  the  BRCAl  mutation.  The  frameshift  resulting  from  the  cytosne 
[nserton  is  apparent  in  lanes  1  through  3.  (The  diamond  shape  in  the 
pedigree  diagram  is  used  to  protect  the  confidentiality  of  the  transmitting 
Snt )  (C)  Inferred  regulatory  mutation  in  kindred  2035.  ASO  analysis  of 
Slotype  cairiers  and  noncar, ieis  used  two  different  polymorphisms. 
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PM1  and  PM7  (Table  3).  Samples  were  examined 

germ  line  and  compared  to  the  heterozygosity  of  lymphoc^e  mRNA.  T^e 
top  two  rows  of  each  panel  contain  PCR  products  amplified  from  genOT^ 
DNA,  and  the  bottom  two  rows  contain  P^R  products  arnpl  Jed  fro 
lymphocyte  cDNA.  "A"  and  “G”  are  the  two  alleles  detected  by  the  ASO^^^ 
The  dark  spots  indicate  that  a  particular  allele  is  present  in  the  sarnple  The 
first  three  lanes  of  the  lower  panel  represent  the  three  genotypes  fro 
unrelated  individuals.  I  heio  is  no  cl  'NA  lor  sample  4  in  either  panel. 
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lose  .early-onset  breast  or  ovarian  cancer 
as  not  ascertained  by  himily  history  sup- 
arts  rhe  view  that  many  early-onset  cases 
e  due  to  mutations  at  the  BRCAI  locus 
18).  the  role  of  BRCAI  in  cancer  progres- 
pn  may  now  be  addressed  with  molecular 
decision.  The  large  size  and  fragmented 
ature  of  the  ccxling  sequence  will  make 
khaustive  searches  for  new  mutations  chal- 
bnging.  Nevertheless,  rhe  laercentage  of  total 
ireast  and  ovarian  cancer  caused  by  mutant 
iRCAl  alleles  will  stxin  be  estimated,  and 
Ldividual  mutation  frequencies  and  pen- 
[tninces  may  be  established.  This  m  turn  may 
^rmit  accurate  genetic  screening  for  predis- 
kasition  to  a  common,  deadly  disease.  Al- 
hough  such  research  represents  an  advance 
n  medical  and  biological  knowledge,  it  also 
■aiscs  numerous  ethical  and  practical  issues, 
X)th  scientific  and  stKial,  that  must  be  ad¬ 
dressed  by  the  medical  community. 

Note  added  in  l>roof:  Analysis  of  kindred 
1911  indicates  a  possible  linkage  to 
BKCA2,  suggesting  that  early  breast  cancer 
|in  this  kindred  is  not  due  to  a  mutation  of 
BRCAI  (51). 
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Summary 

The  breast-ovary  cancer-family  syndrome  is  a  dominant 
predisposition  to  cancer  of  the  breast  and  ovaries  which 
has  been  mapped  to  chromosome  region  17ql2-q21.  The 
majority,  but  not  all,  of  breast-ovary  cancer  families  show 
linkage  to  this  susceptibility  locus,  designated  BRCAl. 
We  report  here  the  results  of  a  linkage  analysis  of  145 
families  with  both  breast  and  ovarian  cancer.  These  fami¬ 
lies  contain  either  a  total  of  three  or  more  cases  of  early- 
onset  (before  age  60  years)  breast  cancer  or  ovarian  cancer. 
All  families  contained  at  least  one  case  of  ovarian  cancer. 
Overall,  an  estimated  76%  of  the  145  families  are  linked 
to  the  BRCAl  locus.  None  of  13  families  with  cases  of 
male  breast  cancer  appear  to  be  linked,  but  it  is  estimated 
that  92%  (95%  confidence  interval  76%-100%)  of  families 
with  no  male  breast  cancer  and  with  two  or  more  ovarian 
cancers  are  linked  to  BRCAl.  These  data  suggest  that  the 
breast-ovarian  cancer-family  syndrome  is  genetically  het¬ 
erogeneous.  However,  the  large  majority  of  families  with 
early-onset  breast  cancer  and  with  two  or  more  cases 
of  ovarian  cancer  are  likely  to  be  due  to  BRCAl  muta¬ 
tions. 
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Introduction 

In  1990,  a  locus  on  chromosome  17q,  BRCAl,  was  found 
to  be  linked  to  a  panel  of  families  with  early-onset  breast 
cancer  (Hall  et  al.  1990).  Shortly  thereafter,  this  locus  was 
shown  also  to  predispose  to  cancer  of  the  ovary  (Narod  et 
al.  1991).  Not  all  families  with  hereditary  breast  cancer  are 
linked  to  BRCAl  (Hall  et  al.  1990;  Sobol  et  al.  1992);  it  is 
estimated  that  fewer  than  half  of  site-specific  breast  cancer 
families  show  linkage  to  BRCAl  (Easton  et  al.  1993). 
However,  the  proportion  for  families  segregating  ovarian 
cancer  in  addition  to  early-onset  breast  cancer  is  much 
higher.  In  the  first  report  of  the  Breast  Cancer  Linkage 
Consortium  (BCLC)  it  was  estimated  that  100%  of  the  57 
breast-ovarian  cancer  families  studied  were  linked  to 
BRCAl,  with  a  lower  95%  confidence  interval  (Cl)  of  79% 
(Easton  et  al.  1993).  Since  this  report,  consortium  mem¬ 
bers  have  studied  many  additional  breast-ovarian  cancer 
families  for  linkage  to  chromosome  17q  markers,  and  the 
possibility  that  a  number  of  these  families  may  not  be 
linked  to  chromosome  17q  has  been  raised.  We  report 
here  the  linkage  results  on  145  breast-ovarian  cancer  fam¬ 
ilies  collected  by  the  BCLC. 

Families,  Material,  and  Methods 

Families 

The  145  families  included  in  this  study  were  all  those 
collected  by  the  BCLC,  and  each  contains  at  least  three 
cases  of  either  early-onset  (before  age  60  years)  breast  can¬ 
cer  or  ovarian  cancer;  each  family  contains  at  least  one 
case  of  ovarian  cancer.  All  but  6  of  the  57  breast-ovarian 
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cancer  families  that  were  included  in  the  first  report  are 
included  in  this  analysis,  but  several  additional  markers 
have  now  been  typed,  and  some  families  have  been  ex¬ 
tended  to  include  newly  identified  cases.  Since  submission 
of  data  for  the  first  consortium  report,  the  other  94  fami¬ 
lies  have  been  investigated  for  linkage.  Nine  families  with 
site-specific  ovarian  cancer,  as  well  as  those  families  with 
male  breast  cancer,  have  also  been  reported  elsewhere 
(Steichen-Gersdorf  et  al.  1994;  Stratton  et  al.  1994). 

Genetic  Markers 

A  total  of  11  markers  were  used  to  genotype  the  145 
families;  however,  the  markers  used  varied  from  group  to 
group.  From  centromere  to  telomere  the  markers  are 
D17S250,  THRAl,  D17S800,  D17S776,  D17S855,  ED2, 
D17S858,  D17S579,  D17S293,  D17S588,  and  NMEl.  The 
primer  sequences  and  map  position  of  D17S250,  THRAl, 
D17S579,  D17S588,  and  D17S93  have  been  presented  by 
Easton  et  al.  (1993);  D17S855  and  D17858  by  Anderson  et 
al.  (1993);  and  D17S776  and  ED2  by  Goldgar  et  al.  (1994). 
NMEl  is  an  RFLP  from  the  nm23  gene  region  (Varesco  et 
al.  1992). 

Statistical  Methods 

LOD  scores  for  linkage  between  BRCAl  and  flanking 
markers  were  calculated  using  LINKAGE  (Lathrop  et  al. 
1984)  and  assuming  (1)  the  disease-specific  risks,  in  gene 
carriers,  derived  by  Easton  et  al.  (1993)  and  (2)  a  .0033 
frequency  of  mutant  BRCAl  alleles.  Under  this  model, 
BRCAl  confers  a  breast  cancer  cumulative  risk  that  is  49% 
by  age  50  years  and  71%  by  age  70  years  and  confers  an 
ovarian  cancer  cumulative  risk  that  is  16%  by  age  50  years 
and  42%  by  age  70  years.  To  implement  this  model  in 
LINKAGE,  different  liability  classes  were  used  for  un¬ 
affected  females  and  for  females  affected  with  either  breast 
cancer  or  ovarian  cancer.  A  total  of  21  liability  classes  were 
constructed,  depending  on  either  the  age  at  cancer  diagno¬ 
sis  or,  in  unaffected  women,  the  age  at  last  observation. 
The  penetrance  values  in  each  liability  class  for  unaffected 
females  were  taken  from  the  estimated  cumulative  inci¬ 
dence  figures,  based  on  population  rates  for  England  and 
Wales.  For  affected  individuals  these  are  given  by  the  age- 
specific  incidence  densities  at  the  midpoint  of  the  age 
group.  These  values  have  been  estimated  from  the  consor¬ 
tium  data  set  by  maximizing  the  likelihood  of  linkage  of 
BRCAl  for  the  breast  and  ovarian  cancer  data  over  all  pos¬ 
sible  penetrance  functions.  The  penetrance  values  used  are 
presented  in  table  1  .These  values  differ  slightly  from  those 
in  the  model  used  in  the  previous  BCLC  analysis,  in  that 
the  new  model  assigns  a  lower  probability  of  an  older  case 
of  breast  cancer  being  a  carrier  of  a  BRCAl  mutation  and 
provides  age-specific  genotype  probabilities  for  ovarian 
cancer  cases. 

Individuals  with  both  breast  and  ovarian  primary  can¬ 
cers  were  treated  as  if  affected  with  breast  cancer  before 


Table  I 


Genetic  Model  Used  in  LINKAGE  Analysis 


Age  Group 
(years) 

Penetrance  OF  Genotype^ 

dd 

Dd 

DD 

Unaffected  females: 

<30 . 

.00025 

.0237 

.0237 

30-39  . 

.0024 

.1357 

.1357 

40-49  . 

.0109 

.4222 

.4222 

50-59  . 

.0266 

.6732 

.6732 

60-69  . 

.0468 

.7946 

.7946 

70-79  . 

.0700 

.8586 

.8586 

>80 . 

Breast  cancer  cases:  . 

.0960 

.9000 

.9000 

<30 . 

.000034 

.00355 

.00355 

30-39  . 

.000366 

.01387 

.01387 

40-49  . 

.00113 

.02768 

.02768 

50-59  . 

.00148 

.01119 

.01119 

60-69  . 

.00180 

.00434 

,00434 

70-79  . 

.00200 

.00327 

.00327 

>80 . 

.00239 

.00231 

.00231 

Ovarian  cancer  cases: 

<30 . 

.000016 

.00113 

.00113 

30-39  . 

.000048 

.00304 

.00304 

40-49  . 

.000178 

.00755 

.00755 

50-59  . 

.000347 

.00613 

.00613 

60-69  . 

.000411 

.00386 

.00386 

70-79  . 

.000430 

.00163 

.00163 

>80 . 

.000384 

.00116 

.00116 

^  “D”  refers  to  the  disease  allele  conferring  susceptibility  to  breast  and 
ovarian  cancers,  and  “d”  is  the  normal  allele.  Males  were  assigned  to  the 
first  liability  class. 


age  30  years,  in  order  to  maximize  their  probability  of  be¬ 
ing  gene  carriers.  Males  with  breast  cancer  were  assigned 
to  the  liability  class  of  female  cases  affected  before  age  30 
years,  because  of  the  rarity  of  male  breast  cancer. 

In  order  to  compute  the  multipoint  LOD  scores,  the 
male  map  distances  between  markers  were  assumed  to  be 
1.3  cM  from  D17S250  to  THRAl  and  2.7  cM  from 
THRAl  to  D17S579.  A  2:1  female:male  genetic  distance 
ratio  was  assumed.  BRCAl  was  placed  midway  between 
THRAl  and  D17S579  (Bowcock  et  al.  1993;  Chamberlain 
et  al.  1993;  Simard  et  al.  1993).  Recently,  BRCAl  has  been 
assigned  to  a  position  telomeric  to  the  markers  D17S800 
(Kelsell  et  al.  1993),  D17S776  (Goldgar  et  al.  1994),  and 
D17S702  (Smith  et  al.  1994).  The  relative  position  of 
BRCAl  with  respect  to  D17S855,  ED2,  or  D17S858  has 
not  been  determined.  In  most  families  a  multipoint  LOD 
score  was  computed  using  flanking  markers  D17S250  (We¬ 
ber  et  al.  1990)  and  D17S579  (Hall  et  al.  1992).  For  a  small 
number  of  families,  data  were  unavailable  for  these  two 
markers,  and  data  for  other  markers  in  the  region,  in  par¬ 
ticular  THRAl,  were  used. 

For  ease  of  computation,  and  because  direct  compari¬ 
son  of  the  frequencies  of  specific  alleles  across  study 


256 


Afn.].  Hum.  Genet.  56:254-264,  1995 


Table  2 

Linkage  Results  for  Breast-Ovarian  Cancer  Families 


LOD 


Assuming 

Homogeneity 

Assuming 

Heterogeneity 

a  (95%  Confidence  Interval) 

All  families  (n  =  145)  . 

Families  with  one  or  more  cases  of  male  breast 

48.29 

57.07 

.76  (.62-.87) 

cancer (n  =  13)  . 

Families  with  no  case  of  male  breast  cancer 

-11.68 

0 

0  (.00-.22) 

(w=  132)  . 

Families  with  no  male  case  and  one  ovarian 

59.97 

62.37 

.88  (.74-.97) 

cancer  (?/  =  51) . 

Families  with  no  male  case  and  two  or  more 

11.74 

13.84 

.81  {.58-.97) 

ovarian  cancers  {n  —  81) . 

48.24 

48.72 

.92  (.76-1.0) 

groups  was  not  possible,  each  of  the  highly  polymorphic 
CA-repeat  markers  was  coded  as  a  system  of  equally  fre¬ 
quent  alleles  (the  exact  number  was  chosen  to  correspond 
to  the  observed  polymorphism  of  the  marker).  Haplotypes 
were  constructed  from  the  observed  segregation  of 
multiple  closely  linked  markers  in  the  extended  families, 
assuming  a  minimum  number  of  genetic  recombinants. 

Evidence  for  heterogeneity  was  evaluated  using  the  ad¬ 
mixture  test,  in  which  a  proportion  of  families  are  assumed 
to  be  linked  to  BRCAl  and  the  remainder  are  assumed  to 
be  linked  to  other  loci  (Smith  1961).  The  HOMOG  pro¬ 
gram  (Ott  1985)  was  used  to  compute  LOD  scores,  with 
heterogeneity  being  assumed. 

Results 

When  the  145  families  are  considered  together,  there 
is  clear  evidence  for  heterogeneity  of  the  breast-ovarian 
cancer  syndrome  (table  2).  Overall,  it  was  estimated  that 
76%  of  the  145  families  studied  were  linked  to  BRCAl 
(95%  Cl  of  62%-87%).  Much  of  the  evidence  for  hetero¬ 
geneity  comes  from  families  with  one  or  more  cases  of 
male  breast  cancer;  it  is  estimated  that  none  of  13  families 
of  this  type  are  linked  to  BRCAl.  Four  of  the  seven  fami¬ 
lies  that  produced  multipoint  LOD  scores  <—1.0  contain 
cases  of  male  breast  cancer.  An  example  of  such  a  family  is 
presented  in  figure  1. 

When  the  families  without  male  cases  are  considered 
separately,  the  evidence  for  heterogeneity  is  much  weaker. 
It  is  estimated  that  88%  of  families  without  male  cases  are 
linked  to  BRCAl,  with  95%  Cl  of  74%-97%.  Several  of 
the  families  that  produce  negative  LOD  scores  contain 
only  a  single  case  of  ovarian  cancer.  Among  the  81  breast- 
ovarian  cancer  families  with  two  or  more  cases  of  ovarian 
cancer,  the  LOD  score  calculated  under  homogeneity  was 
not  significantly  less  than  that  calculated  under  heteroge¬ 
neity,  and  the  possibility  that  all  of  the  families  were  linked 
to  BRCAl  could  not  be  rejected.  It  was  also  estimated  that 


100%  of  the  11  families  with  predominantly  ovarian  can¬ 
cer  (i.e.,  no  male  cases  and  no  case  of  breast  cancer  under 
age  60  years)  were  linked  to  BRCAl  (LOD  score  of  2.06). 

Only  three  families  gave  multipoint  LOD  scores  <—2.0 
and  thereby  satisfy  the  conventional  criterion  of  an  un¬ 
linked  family  (table  3);  two  of  these  families  contained  at 
least  one  case  of  male  breast  cancer  and  have  been  the 
subject  of  a  previous  report  (Stratton  et  al.  1994).  Four 
additional  breast-ovarian  cancer  families  give  LOD  scores 
in  the  range  of  —1.0  to  —2.0.  Two  of  these  families  also 
contained  a  male  case.  The  pedigrees  and  haplotypes  of 
the  three  families  without  male  cases  are  given  in  figure  2, 
and  these  families  are  discussed  individually  below. 

Leiden  9 

This  family  contains  nine  cases  of  breast  cancer  and  a 
single  case  of  ovarian  cancer.  Three  of  the  breast  cancers 
occurred  before  the  age  of  40  years.  Individuals  52  and  44 
are  sisters  and  were  both  affected  with  breast  cancer  at 
the  age  of  34  years.  They  have  inherited  different  paternal 
chromosome  17  haplotypes,  although  it  appears  that  the 
cancer  susceptibility  gene  in  this  sibship  was  inherited 
through  the  father.  Individual  12,  also  diagnosed  with 
breast  cancer  at  age  34  years,  shares  no  haplotype  with 
either  of  her  affected  cousins. 

Iceland  4 

In  addition  to  eight  breast  cancers  and  one  ovarian  can¬ 
cer,  this  unusual  pedigree  contains  four  cases  of  stomach 
cancer  and  five  cases  of  prostate  cancer.  Two  of  the  breast 
cancer  cases  are  bilateral.  There  is  no  common  chromo¬ 
some  17  haplotype  segregating  among  the  affected  mem¬ 
bers.  Individual  9,  who  developed  breast  cancer  at  age  60 
years,  later  developed  cancer  of  the  kidney  and  cancer  of 
the  ureter.  Cancer  of  the  stomach  and  cancer  of  the  uri¬ 
nary  system  are  seen  in  excess  in  the  nonpolyposis  colon 
cancer  syndromes. 
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Figure  I  Pedigree  of  family  with  male  breast  cancer.  Blackened  circles  indicate  affected  women;  and  a  diagonal  slash  indicates  an  individual 
who  is  deceased.  Individual  identification  numbers  appear  directly  below  the  symbols.  Br  =  breast  cancer;  Ov  =  ovarian  cancer;  and  Sto  -  stomach 
cancer.  The  numbers  following  these  abbreviations  indicate  age  at  diagnosis.  The  numbers  arranged  vertically  below  the  individual  symbols  indicate 
the  marker  alleles  arranged  into  haplotypes.  Haplotypes  in  brackets  are  inferred.  Marker  alleles  separated  by  a  comma  cannot  be  phased.  An  x  indicates 
position  of  recombination. 


/ARC  2850 

There  does  not  seem  to  be  a  consistent  haplotype  segre¬ 
gating  in  this  large  breast-ovarian  cancer  family,  although 
in  the  left-hand  branch  the  (2, 5, 1,3)  haplotype  appears  to 
segregate  with  the  trait.  This  haplotype  does  not  appear  in 
any  of  the  other  affected  women,  although  there  have  been 
many  opportunities  for  genetic  recombination  between 
the  different  branches  of  the  haplotype.  This  problem  may 
be  resolved  when  additional  markers  and  individuals  are 
tested. 

In  addition  to  these  few  families  that  are  clearly  un¬ 
linked  to  BRCAl,  there  were  30  families  with  weaker  evi¬ 
dence  against  linkage  (LOD  scores  of  —0.10  to  —0.99).  In 
most  of  these  families,  a  single  breast  cancer  patient  or 
ovarian  cancer  patient  has  a  haplotype  inconsistent  with 


linkage.  Some  of  these  families  could  be  due  to  other 
genes,  but  there  is  also  the  possibility  that  the  inconsistent 
case  could  be  sporadic.  An  example  of  a  family  of  this  type 
is  CRC  128  (fig.  3),  in  which  individual  301  was  diagnosed 
with  breast  cancer  at  age  35  years  but  does  not  share  a 
common  haplotype  with  her  four  affected  sisters,  all  of 
whom  have  inherited  the  same  marker  alleles,  from 
D17S579  to  NMEl,  from  their  affected  mother.  If  this 
family  is  actually  linked  to  BRCAl,  then  individual  301 
appears  to  be  a  sporadic  case  of  breast  cancer.  Alterna¬ 
tively,  a  double  recombinant  may  have  occurred  between 
D17S250  and  D17S579  (a  distance  of  8  cM  in  females) 
(LOD  score  of  —0.91). 

The  cancer  mutation  in  the  left-hand  branch  of  family 
lARC  2651  appears  to  be  linked  to  the  BRCAl  locus  in  all 


Table  3 

Characteristics  of  Seven  Breast-Ovarian  Cancer  Families  with  Multipoint  LOD  Scores  <- 1 .00 


No.  of  Breast  Cancers  No.  of  Male  Breast  No.  of  Ovarian 


Family  before  60  years  Cancers  Cancers _ LOD  Score^ 


RUL9 .  7  0  1  -2.01 

RUL49 .  4  1  3  -1.11 

CRC  186 .  16  1  1  -2.61 

IRAC2850  .  15  0  2  -1.71 

UTAH  2044 .  7  1  4  -1.32 

UTAH  107 .  32  3  6  -2.45 

ICEL  80004  .  6  0  1  -1.89 


®  Derived  from  three-point  linkage  analyses  using  the  flanking  markers  D17S250  and  D17S579. 
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five  affected  women  (fig.  4).  However,  the  linked  chromo¬ 
some  was  inherited  from  the  grandfather,  individual  69. 
The  other  three  affected  women  in  the  family  do  not  share 
a  haplotype  with  the  left-hand  branch.  Individual  120  de¬ 
veloped  a  mucinous  ovarian  cancer  at  age  26  years.  Under 
the  current  model,  this  woman  would  be  predicted  to  be  a 
BRCAl  mutation  carrier.  However,  if  the  phenotype  of 
this  young  woman  is  coded  as  ambiguous,  then  the  LOD 
score  for  this  family  becomes  1.39.  lARC  1812  is  also  con¬ 
sistent  with  segregation  of  a  mutation  at  the  BRCAl  locus, 
with  the  exception  of  individual  21,  who  is  the  single 
woman  with  ovarian  cancer  in  the  family  (fig,  4).  She  de¬ 
veloped  a  mucinous  cystadenocarcinoma  of  the  ovary  at 
age  17  years  and  is  the  youngest  case  of  ovarian  cancer  in 
the  consortium  data  set. 

Discussion 

This  analysis  confirms  our  previous  results,  which  dem¬ 
onstrated  that  a  large  majority  of  breast-ovarian  cancer 
families  are  linked  to  BRCAl.  However,  in  contrast  to  the 
previous  analysis,  under  the  current  definition  of  the  breast 
ovarian  cancer  family  syndrome  there  is  now  clear  evi¬ 
dence  for  genetic  heterogeneity,  with  an  estimated  76%  of 
the  families  being  linked.  The  negative-linkage  evidence 
comes  both  from  families  that  also  contain  male  breast 
cancer  cases  and  from  families  with  a  single  case  of  ovarian 
cancer.  There  is  little,  if  any,  evidence  for  genetic  hetero¬ 
geneity  among  families  with  at  least  two  cases  of  ovarian 
cancer.  These  data  support  a  refined  clinical  definition  of 
the  breast-ovarian  cancer-family  syndrome — as  being 
those  families  with  both  two  or  more  cases  of  early-onset 
breast  cancer  and  two  or  more  cases  of  ovarian  cancer. 
Under  this  refined  definition,  it  can  be  expected  that  al¬ 
most  all  breast-ovarian  cancer  families  will  be  attributable 
to  BRCAl  mutations. 

It  is  important  to  estimate  accurately  the  extent  of  ge¬ 
netic  heterogeneity,  both  for  purposes  of  genetic  counsel¬ 
ing  and  to  expedite  positional  cloning.  Genetic  counseling 
using  BRCAl  markers  can  now  be  considered  if  it  can  be 
established  with  reasonable  certainty  that  a  family  segre¬ 
gates  a  BRCAl  mutation  (Biesecker  et  al.  1993;  King  et  al. 
1993;  Lynch  et  al.  1993;  Narod  1993).  If  there  is  significant 
genetic  heterogeneity  in  the  syndrome,  then  it  will  be  nec¬ 
essary  to  take  this  into  account  and  to  establish  the  prob¬ 
ability  that  a  given  family  is  linked,  before  risk  assessment 
is  considered.  In  contrast,  if  all  breast-ovarian  cancer  fam¬ 
ilies  are  linked  to  the  same  locus,  as  the  first  consortium 
paper  suggested,  then  counseling  could  be  pursued  in  any 
large  family  of  the  type  described  here  (i.e.,  when  chance 
clustering  is  not  a  realistic  alternative).  On  the  basis  of  our 
current  results,  we  feel  that  it  is  inappropriate  to  offer  ge¬ 
netic  counseling  to  breast-ovarian  cancer  families  unless 
the  possibility  of  genetic  heterogeneity  is  taken  into  ac¬ 
count.  It  is  also  reasonable  at  this  time  to  consider  all 


women  who  carry  BRCAl  mutations  to  be  at  increased 
risk  for  both  breast  cancer  and  ovarian  cancer. 

Genetic  linkage  is  currently  the  principal  tool  employed 
by  investigators  trying  to  identify  BRCAl.  This  is  both  be¬ 
cause  there  have  been  no  reports  of  a  chromosome  17  re¬ 
arrangement  segregating  in  a  breast  cancer  family  and  be¬ 
cause  the  potential  for  mapping  BRCAl  by  tumor  dele¬ 
tions  (i.e.,  by  loss  of  heterozygosity)  is  limited.  Several 
chromosome  17  loci  may  be  involved  in  somatic  mutations 
in  breast  cancers  and  ovarian  cancers  (Saito  et  al.  1993), 
and  the  pattern  of  chromosome  17  rearrangements  in 
these  tumor  types  is  complex  (Foulkes  et  al.  1991;  Cornells 
et  al.  1993).  Because  of  the  importance  of  genetic  mapping 
in  the  search  for  BRCAl,  each  informative  recombinant 
needs  to  be  scrutinized  both  in  terms  of  the  probability  of 
linkage  of  the  family  and  in  terms  of  the  number,  ages,  and 
sites  of  cancers  in  the  affected  women.  In  the  pedigrees 
reported  here,  there  appear  to  be  several  “sporadic”  cases 
among  families  that  otherwise  appear  to  be  linked  to 
BRCAl.  It  is  important  to  appreciate  that  crossovers  that 
appear  in  affected  women  from  presumably  linked  families 
are  currently  used  to  define  the  region  of  assignment  of 
BRCAl  (e.g.,  see  Simard  et  al.  1993;  Tonin  et  al.  1994). 
If  these  crossovers  were  actually  in  women  with  sporadic 
cancers,  or  if  the  susceptibility  gene  in  the  family  was  not 
linked  to  BRCAl,  then  false  inferences  about  the  location 
of  BRCAl  would  result.  Because  of  sporadic  cases,  and 
because  it  is  now  appears  unlikely  that  all  breast-ovarian 
cancer  families  are  linked  to  BRCAl,  the  evidence  pro¬ 
vided  by  any  single  crossover,  in  regard  to  the  mapping  of 
BRCAl,  is  inconclusive. 

A  recent  report  describes  a  germ-line  p53  mutation  that 
segregated  in  a  breast-ovarian  cancer  family  (Jolly  et  al. 
1994),  and  another  report  proposes  that  the  proportion  of 
these  families  with  germ-line  p53  mutations  may  be  sub¬ 
stantial  (Buller  et  al.  1993).  These  reports  suggest  that  p53 
testing  in  families  apparently  unlinked  to  BRCAl  may  be 
warranted. 

However,  for  many  of  the  observed  pedigrees,  there  are 
explanations  other  than  genetic  heterogeneity.  It  is  possi¬ 
ble  that  some  of  the  familial  clustering  is  due  to  chance.  It 
may  be  that  a  mutation  in  the  BRCAl  is  the  cause  of  most, 
but  not  all,  of  the  cases  of  breast  cancer  and  ovarian  cancer 
in  a  particular  family.  If  the  sporadic  case  is  young,  then 
there  need  to  be  several  affected  women  in  a  linked  family 
to  obtain  a  positive  LOD  score.  However,  if  the  sporadic 
case  is  postmenopausal,  there  will  be  little  effect  on  the 
observed  LOD  score,  because,  under  the  revised  model, 
little  weight  is  accorded  to  late-onset  breast  cancer  cases 
(data  not  shown).  Similarly,  a  family  could  be  a  hereditary 
site-specific  breast  cancer  family  unlinked  to  BRCAl  but 
with  a  single  sporadic  ovarian  cancer  case.  In  this  case,  the 
family  would  be  indistinguishable  from  a  breast-ovarian 
cancer  family.  A  higher  proportion  of  BRCAl  linkage  was 
observed  among  families  with  two  or  more  ovarian  cancers 
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Figure  3  Pedigree  CRC  128.  Symbols  are  as  in  figs.  1-2.  Ut  = 
endometrial  cancer.  Vertical  rectangles  are  used  to  indicate  the  most  fre¬ 
quent  haplotype  found  to  be  associated  with  breast  or  ovarian  cancer  in 
the  family,  but  they  are  not  intended  to  imply  that  a  BRCAl  mutation  is 
necessarily  present. 


(92%)  than  among  those  with  a  single  ovarian  cancer 
(81%).  This  is  not  unexpected,  given  the  possibility  of  spo¬ 
radic  cancers;  it  is  possible  that  our  data  set  contains  some 
site-specific  breast  cancer  families  with  a  single  case  of  spo¬ 
radic  ovarian  cancer  (Leiden  9  and  Iceland  4  are  potential 
examples  of  such  families).  The  probability  of  two  spo¬ 
radic  ovarian  cancers  in  a  single  family  is  more  remote.  It 
is  also  possible  that  on  other  chromosomes  there  exist 
genes  that  confer  a  high  risk  of  breast  cancer  and  a  low  risk 
of  ovarian  cancer. 

Genetic  recombination  is  an  alternate  explanation  for 
discordant  haplotypes,  but  the  possibility  is  remote  when 
double  crossovers  in  a  small  region  need  be  inferred.  Given 
the  density  of  the  markers  employed  here  to  construct 
haplotypes,  double  recombination  must  be  considered  to 
be  an  unlikely  possibility.  However,  in  some  families,  dis¬ 
tant  pedigree  branches  have  been  typed.  In  the  absence  of 
genotypic  information  for  intervening  relatives,  it  is  not 
possible  to  establish  if  several  single  crossovers  have  accu¬ 
mulated.  lARC  2850  is  a  potential  example  of  such  a  fam¬ 
ily  (fig.  1).  Similarly,  there  may  be  two  different  mutations 
of  BRCAl  that  are  segregating  in  the  same  family.  This 
possibility  must  also  be  considered  in  pedigrees  where  a 
number  of  distantly  related  relatives  are  affected  but  clini¬ 
cal  information  on  intervening  relatives  is  missing.  Because 
several  of  the  Utah  families  have  been  identified  by  merg¬ 
ing  cancer  registry  data  and  genealogical  information, 
these  families  potentially  could  represent  situations  either 
where  a  BRCAl  mutation  has  entered  the  pedigree  more 


than  once  or  where  a  new  BRCAl  mutation  has  occurred. 
Similarly,  there  may  be  unidentified  consanguinity,  and  a 
single  BRCAl  mutation  may  have  entered  a  pedigree 
through  two  individuals  thought  to  be  unrelated. 

An  interesting  possibility  is  that  there  is  a  second  breast 
cancer  susceptibility  gene  on  chromosome  17,  close  to  the 
region  spanned  by  the  markers  studied  here.  In  this  case, 
several  cases  of  cancer  in  a  family  will  appear  to  be  linked 
to  BRCAl,  as  will  one  or  two  additional  cases  that  appear 
to  be  sporadic  but  that  in  fact  represent  recombination 
between  the  BRCAl  locus  and  the  actual  susceptibility 
gene.  The  “sporadic”  cases  would,  on  average,  have  the 
same  early  age  of  diagnosis  as  do  the  other  genetic  cases. 

Errors  in  diagnosis,  mislabeling  of  specimens,  or  errors 
in  assigning  parentage  are  also  possible  explanations  for 
discordant  genotypes.  Spurious  evidence  against  linkage 
may  occur  if  the  cancer  diagnosis  is  wrong,  e.g.,  if  the  tu¬ 
mour  site  was  misspecified  or  if  the  cancer  was  metastatic 
to  the  ovary. 

It  is  possible  that,  in  some  cases,  ductal  carcinoma  in 
situ  may  have  been  coded  as  breast  cancer.  It  has  not  yet 
been  established  if  the  frequency  of  this  precancerous  con¬ 
dition  is  increased  among  carriers  of  BRCAl  mutations. 
This  diagnosis  is  increasingly  common  because  of  mam- 
mographic  screening  of  high-risk  families.  For  example, 
lARC  family  2090  is  a  linked  breast-ovarian  cancer  family 
with  two  cases  of  ovarian  cancer  and  five  cases  of  breast 
cancer  (fig.  5).  A  42-year-old  woman  in  this  family  (individ¬ 
ual  107),  who  does  not  carry  the  17q  haplotype  of  risk, 
was  recently  diagnosed,  through  screening,  as  having  duc¬ 
tal  carcinoma  in  situ.  Were  she  to  be  considered  affected, 
the  LOD  score  would  diminish  from  1.25  to  0.55.  It  will 
be  important  to  determine  the  extent  to  which  this  condi¬ 
tion  is  associated  with  BRCAl  mutations  in  these  families. 

It  is  equally  important  to  know  which  histologic  types  of 
ovarian  cancer  are  overrepresented  in  cancer  families.  In  the 
Gilda  Radner  Familial  Ovarian  Cancer  Registry,  cases  of 
mucinous  adenocarcinoma  and  cystadenocarcinoma  were 
underrepresented.  Only  6  (1.4%)  of  439  familial  ovarian 
cancers  were  mucinous,  in  comparison  with  12.7%  of  unse¬ 
lected  ovarian  cancers  in  the  SEER  database  (Fiver  et  al. 
1993).  There  were  no  mucinous  carcinomas  among  the  37 
hereditary  ovarian  cancers  reported  by  Bewtra  et  al.  (1992), 
among  11  familial  ovarian  cancers  reported  by  Greggi  et  al. 
(1990),  or  among  13  hereditary  ovarian  cancers  in  Ontario 
(Narod  et  al.  1994).  In  lARC  families  1812  and  2651,  the 
“unlinked”  ovarian  cancers  were  both  mucinous.  In  con¬ 
trast,  only  2  of  36  ovarian  cancers  among  BRCAl  carriers 
in  the  I  ARC  data  set  were  mucinous  (Narod  et  al.  1994). 
Nor  is  it  clear  if  borderline  ovarian  tumors  confer  familial 
risk.  Only  5  of  439  ovarian  cancers  in  the  Gilda  Radner 
registry  were  of  borderline  histology  (Fiver  et  al.  1993). 
Bewtra  et  al.  (1992)  found  2  of  37  hereditary  ovarian  can¬ 
cers  to  be  borderline.  Data  on  histologic  subtypes  of  epithe¬ 
lial  ovarian  cancer  are  not  routinely  available  on  all  consor- 
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Figure  5  Pedigree  IARC2090.  Symbols  are  as  in  figs.  1-3.  Pan 
^  pancreatic  cancer;  Ki  =  kidney  cancer;  and  Cx  =  cervical  cancer.  Indi¬ 
vidual  107  had  in  situ  carcinoma. 


tium  families,  and  it  will  be  important  to  extend  these  ob¬ 
servations  to  other  unlinked  families. 

In  conclusion,  we  believe  that,  in  addition  to  BRCAl, 
there  are  other  genes  that  predispose  to  both  cancer  of 
the  ovary  and  cancer  of  the  breast.  Although  the  overall 
evidence  favors  genetic  heterogeneity,  several  explana¬ 
tions  need  to  be  explored  before  a  particular  family  can  be 
classified  as  unlinked.  The  extent  of  heterogeneity  is  much 
reduced  for  the  subset  of  families  with  two  or  more  cases 
of  ovarian  cancer.  No  doubt  the  situation  will  be  clarified 
either  when  additional  susceptibility  loci  are  mapped  or 
when  BRCAl  is  identified  and  the  direct  search  for  muta¬ 
tions  in  families  becomes  possible. 
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Genetic  Heterogeneity  of  Breast-Ovarian  Cancer 
Revisited 

To  the  Editor: 

We  have  recently  reported  the  results  of  a  linkage  analy¬ 
sis  of  145  breast-ovarian  cancer  families  (Narod  et  al. 
1995).  Each  family  has  three  or  more  cases  of  early- 
onset  breast  cancer  (age  <60)  or  of  ovarian  cancer,  and 
all  families  have  at  least  one  case  of  ovarian  cancer  (there 
were  nine  site-specific  ovarian  cancer  families).  Overall, 
we  estimated  that  76%  of  the  families  were  linked  to 
the  BRCAl  locus. 

Among  the  145  families  studied,  there  were  only  10 
families  that  gave  multipoint  lod  scores  <-0.80,  with 
markers  flanking  BRCAl,  and  that  therefore  were  con¬ 
sidered  unlikely  to  be  linked  to  this  locus.  Since  the 
submission  of  our  report,  the  BRCAl  gene  has  been 
identified  (Miki  et  al.  1994),  and  the  BRCA2  has 
been  mapped  to  chromosome  13q  (Wooster  et 
Additional  studies  have  now  revealed  the  genetic  basis 
of  each  of  these  10  families— 3  families  were  found  to 
carry  BRCAl  mutations,  and  7  families  show  positive 


termination  at  codon  285.  Because  of  the  presence  of  a 
sporadic  case  of  breast  cancer  in  this  family,  at  age  35 
years,  the  lod  score  to  BRCAl  is  -0.91.  Family  RUL49 
is  a  Dutch  family  with  four  early-onset  breast  cancers, 
one  male  breast  cancer,  and  three  ovarian  cancers.  Re¬ 
cently  the  family  has  been  found  to  carry  a  BRCAl 
frameshift  mutation  leading  to  termination  at  codon  780 
(Hogervorst  et  al.  1995).  Because  of  a  sporadic  early- 
onset  breast  cancer,  the  BRCAl  lod  score  is  negative 
(-1.11).  This  is  a  rare  example  of  a  family  with  male 
breast  cancer  and  a  BRCAl  mutation.  A  second  example 
has  been  reported  by  Struewing  et  al.  (1995). 

The  other  seven  families  appear  to  be  linked  to 
BRCA2,  with  multipoint  lod  scores  ranging  from  0.93 
to  3.70  (table  1).  Five  of  these  families  contain  cases  of 
male  breast  cancer,  and  five  contain  only  a  single  case 
of  ovarian  cancer.  For  six  families  the  evidence  of  link¬ 
age  to  BRCA2  is  very  strong  (lod  scores  >1.6);  for  the 
other  family  (ICELAND  2204)  the  linkage  evidence  is 
more  modest. 

In  summary,  none  of  the  145  families  in  this  large  data 
set  now  provides  clear  evidence  against  both  linkage  to 
BRCAl  and  linkage  to  BRCA2.  These  results  indicate 
that  BRCAl  and  BRCA2  account  for  the  majority  of 
breast-ovarian  cancer  families.  Studies  are  now  under¬ 
way  that  will  estimate  the  proportions  of  breast  cancer 
families  attributable  to  each  of  these  loci.  It  remains 
possible  that  a  third  breast  cancer  locus  will  be  found, 
but  it  is  unlikely  that  this  locus  will  account  for  a  signifi¬ 
cant  proportion  of  families  with  breast  and  ovarian 

cancer. 
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linkage  to  the  BRCA2  locus.  ..  u 

I  ARC  2850  carries  a  BRCAl  mutation  leading  to  the 
loss  of  exon  5.  This  family  originally  gave  a  multipoint 
lod  score  of  -1.71  at  BRCAl,  but  it  can  now  be  seen 
that  the  family  contains  two  sporadic  cases  of  cancer 
and  that  two  recombination  events  have  ocoi^red  be¬ 
tween  the  flanking  markers  D17S250  and  D17S579.  The 
second  family  with  a  BRCAl  mutation  is  CRC  128. 
This  mutation  in  this  family  leads  to  premature  chain 
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Table  I 

Breast  Ovary  Families  with  Negative  Lod  Scores  to  BRCAI 


No.  OF  Cancers 


Breast 

Lod  Score  BRCAI  Lod  Score 

Family  Female  Male  Ovarian  TO  BRCAr  Mutation?*'  to  BRCA2*’’*' 


lARC  2850  .  15  0  2  -1.71  Yes 

RUL49 .  4  1  3  -1.11  Yes 

CRC  128  .  4  0  4  -.91  Yes 

CRC  186  .  16  1  1  -2.61  ...  3.70 

RUL9  .  7  0  1  -2.01  ...  1.64 

UTAH  107  .  32  3  6  -2.45  . . .  3.48 

UTAH  2044  .  7  1  4  -1.32  ...  2.11 

ICELAND  6  .  15  1  1  -.96  . . .  2.92 

ICELAND  2204  .  3  4  1  -.85  ...  .93 

ICELAND  80004  .  8  0  1  -1.89  .  . .  1.82 


Note. — The  linkage  model  is  presented  in  the  report  by  Narod  et  al.  (1995). 

Based  on  the  markers  D17S250  and  D17S579. 

^  An  ellipsis  indicates  that  the  test  was  not  done. 

"  Based  on  markers  D13S260,  D13SS289,  and  D13S267. 


Miki  Y,  Swensen  J,  Shattuck-Eidens  D,  Futreal  FA,  Harshman 
K,  Tavrigian  S,  Liu  Q,  et  al  (1994)  A  strong  candidate  gene 
for  the  breast  and  ovarian  cancer  susceptibility  gene  BRCAI. 
Science  266:66-71 

Narod  SA,  Ford  D,  Devilee,  P,  Barkardottir  RB,  Lynch  HT, 
Smith  SA,  Ponder  BAJ,  et  al  (1995)  An  evaluation  of  genetic 
heterogeneity  in  145  breast-ovarian  cancer  families.  Am  J 
Hum  Genet  56:254-264 

Struewing  JP,  Brody  LC,  Erdos  MR,  Kase  RG,  Giambarresi 
TR,  Smith  SA,  Collins  FS,  et  al  (1995)  Detection  of  eight 
BRCAI  mutations  in  10  breast/ovarian  cancer  families,  in¬ 
cluding  1  family  with  male  breast  cancer.  Am  J  Hum  Genet 
57:1-7 

Wooster  R,  Neuhausen  SL,  Mangion  J,  Quirk  Y,  Ford  D, 
Collins  N,  Nguyen  K,  et  al  (1994)  Localisation  of  a  breast 
cancer  susceptibility  gene  BRCA2,  to  chromosome  13ql2- 
13,  Science  265:2088-2090 

Address  for  correspondence  and  reprints:  Dr.  Steven  Narod,  Montreal  General 

Hospital,  Division  of  Medical  Genetics,  1650  Cedar  Avenue,  Room  LlO-107, 

Montreal,  Quebec  H3G  1A4,  Canada. 

©  1995  by  The  American  Society  of  Human  Genetics.  AH  rights  reserved. 

0002-9297/95/5704-002752.00 


Am./.  Hum.  Genet.  57:958-960,  1995 

CFTR  Gene  Variant  for  Patients  with  Congenital 
Absence  of  Vas  Deferens 

To  the  Editor: 

Obstructive  azoospermia  due  to  congenital  absence  of 
vas  deferens  is  a  prominent  clinical  feature  among  male 


patients  with  cystic  fibrosis  (CF)  (Flolsclaw  et  al.  1971). 
A  similar  autosomal  recessive  condition  with  no  other 
CF  manifestations  is  classified  as  congenital  bilateral  ab¬ 
sence  of  vas  deferens  (CBAVD;  McKusick  277180; 
Schellen  and  van  Stratten  1980).  Since  50% -64%  of 
CBAVD  patients  have  been  found  to  be  positive  for  at 
least  one  known  CFTR  mutation,  it  is  believed  that  at 
least  part  of  the  CBAVD  population  represents  an  atypi¬ 
cal  form  of  CF  affecting  only  the  male  reproductive  sys¬ 
tem  (Dumur  et  al.  1990;  Anguiano  et  al.  1992;  Gervais 
et  al  1993;  Osborne  et  al.  1993;  Patrizio  et  al.  1993; 
Culard  et  al.  1994).  This  explanation  is  not  completely 
satisfactory,  however,  because  only  ~10%  of  CBAVD 
patients  are  found  to  carry  known  CF  mutations  on 
both  chromosomes,  even  after  exhaustive  screening  of 
the  entire  CFTR  coding  region.  Here  we  present  data  to 
show  that  a  previously  known  sequence  variant  in  intron 
8  of  the  CFTR  gene  (Chu  et  al.  1993)  is  a  specific  and 
frequent  mutation  associated  with  CBAVD. 

Varied  lengths  of  a  thymidine  (T)-tract  (5,  7,  or  9T) 
have  been  noted  in  front  of  the  splice-acceptor  site  of 
intron  8  (Chu  et  al.  1993).  The  length  appears  to  corre¬ 
late  with  the  efficiency  of  exon  9  splicing,  with  the  5T 
variant  that  is  present  in  5%  of  the  CFTR  alleles  among 
the  Caucasian  population  producing  almost  exclusively 
(95%)  exon  9-minus  mRNA  (Chu  et  al.  1993).  The 
effect  of  this  T-tract  polymorphism  in  CFTR  gene  ex¬ 
pression  is  also  documented  by  its  relationship  with  a 
known  CF  mutation  R117H  (Dean  et  al.  1990):  While 
R117H(5T)  is  found  in  typical  CF  patients  with  pancre¬ 
atic  sufficiency,  R117H(7T)  is  associated  with  CBAVD 
(Kiesewetter  et  al.  1993),  More  recently,  CFTR  alleles 
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SUMMARY  The  majority  of,  but  not  all,  women  with  mutations  in 
the  BRCAl  gene  will  be  affected  with  breast  or  ovarian  cancer  by 
the  age  of  70.  To  establish  whether  known  risk  factors  modify 
susceptibility  to  cancer  in  these  women  we  have  studied  the 
reproductive  histories  of  333  North  American  women  who  were  found 
by  haplotype  analysis  to  carry  BRCAl  mutations.  An  increased 
risk  for  breast  cancer  was  associated  with  low  parity  and  with 
recent  birth  cohort.  The  risk  of  ovarian  cancer  decreased  with 
increasing  age  at  last  childbirth;  however,  in  contrast  to  the 
case  for  sporadic  cancer,  the  risk  of  ovarian  cancer  in  BRCAl 
carriers  was  found  to  increase  significantly  with  increasing 
parity. 


INTRODUCTION 


The  BRCAl  gene  on  chromosome  region  17ql2  predisposes  to  both 
breast  and  ovarian  cancer  (Hall  et  al,  1990,  Narod  et  al,  1991, 
Miki  et  al,  1994.  It  has  been  estimated  that  between  1  in  200 
and  1  in  2000  women  carry  mutations  in  this  gene  (Easton  et  al, 
1994)  .  Carriers  of  BRCAl  mutations  in  breast-ovary  cancer 
families  can  be  identified  by  performing  linkage  analysis  using 
polymorphic  markers  from  chromosome  17q  (e.g.  Easton  et  al,  1993; 
Narod  et  al,  1995)  or  directly  by  genetic  sequencing  of 
constitutional  DNA  (Miki  et  al,  1994;  Simard  et  al,  1994; 
Friedman  et  al,  1994;  Castilla  et  al,  1994;  Shattuck-Eidens  et 
al,  1995) . 


The  clinical  expression  of  BRCAl  appears  to  vary  from  family 
to  family  (Easton  et  al,  1994).  These  differences  may  prove  to 
be  due  in  part  to  different  mutant  BRCAl  alleles  associated  with 
different  degrees  of  penetrance.  The  majority  of  known  carriers 
to  date  have  been  identified  in  large  kindreds  with  multiple 
affected  members.  It  has  been  suggested  that  these  extended 
pedigrees  may  be  atypical,  and  that  BRCAl  mutations  with  less 
profound  effects  may  be  more  common  (Rowell  et  al,  1994) .  Within 
a  family  it  is  also  common  to  see  different  effects  of  a  BRCAl 
mutation;  some  women  may  be  affected  with  breast  cancer  in  their 
thirties,  whereas  others  may  remain  unaffected  into  their 
seventies.  There  are  several  alternate  explanations  for  the 
observed  variability  within  families,  including:  1)  the  random 
accumulation  of  mutations  in  other  critical  genes;  2)  possible 
unidentified  environmental  factors;  3)  the  modification  of  cancer 
risk  by  pregnancy  or  by  the  use  of  exogenous  hormones;  and  4)  the 
influence  of  allelic  variation  of  other  genes  on  the  BRCAl 
phenotype . 


The  discovery  of  BRCAl  modifiers  may  clarify  our 
understanding  of  the  mode  of  action  of  this  gene  and  may  lead  us 
to  refine  our  current  models  of  hormonal  carcinogenesis.  It  is 
also  important  to  identify  non-genet ic  risk  modifiers  in  order  to 
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counsel  BRCAl  mutation  carriers  about  potential  means  of  reducing 
their  risk.  We  have  constructed  a  historical  cohort  of  333 
female  carriers  of  BRCAl  mutations  in  four  centers  in  North 
America  and  have  studied  the  effects  of  reproductive  and  other 
variables  on  breast  and  ovarian  cancer  incidence. 

SUBJECTS  AND  METHODS 


Identification  of  families  Families  were  identified  through  four 
clinical  centers  in  North  America  where  BRCAl  mutations  analysis 
is  currently  offered.  All  families  contained  cases  of  ovarian 
cancer  in  addition  to  multiple  cases  of  breast  cancer.  No  family 
contained  a  case  of  male  breast  cancer.  It  is  estimated  that  88% 
of  such  families  are  linked  to  BRCAl  (Narod  et  al,  1995) .  All 
families  had  lod  scores  of  0.5  or  greater  with  linkage  to  BRCAl, 
using  either  flanking  markers  (D17S250,  THRAl,  D17S579,  D17S588) 
or  the  intragenic  marker  D17S855.  Details  of  the  markers  used 
are  provided  in  Miki  et  al  (1994)  and  Easton  et  al  (1993).  A 
lod  score  of  0.50  corresponds  to  a  posterior  probability  of 
linkage  of  95%,  but  for  most  families  the  probability  of  linkage 
to  BRCAl  was  much  higher  than  this.  Mutations  in  BRCAl  have  now 
been  identified  for  21  of  the  28  families,  including  all  3  of  3 
Utah  families,  4  of  6  McGill  families  2  of  3  Pennsylvania 
families,  (Simard  et  al,  1994;  Friedman  et  al,  1994;  Shattuck- 
Eidens  et  al,  1995)  and  12  of  16  Creighton  families  (unpublished 
data)  . 


Identification  of  BRCAl  carriers  Individuals  were  considered  to 
be  carriers  in'^he  families  by  the  inspection  of  segregating 
haplotypes.  Women  for  whom  haplotype  data  was  not  available  were 
excluded  from  the  analysis.  In  some  cases,  haplotypes  were 
constructed  by  inference  using  marker  data  derived  from  children 
and  spouses.  Women  were  considered  to  be  carriers  of  BRCAl 
mutations  if  they  inherited  the  allele  of  risk  for  the  intragenic 
marker  D17S855.  If  D17S855  was  not  typed,  or  was  uninformative, 
the  flanking  markers  were  used  to  establish  the  carrier  status. 
If  there  was  evidence  for  a  recombination  event  between  the 
flanking  markers  THRAl  and  D17S579,  and  the  position  of  the 
recombinant  could  not  be  resolved  with  respect  to  D17S855,  the 
BRCAl  status  was  considered  to  be  ambiguous,  and  the  case  was 
excluded  from  the  analysis.  Within  sibships  BRCAl  testing  was 
was  independent  of  disease  status. 

A  total  of  333  BRCAl  carrier  women  were  identified  through 
haplotype  analysis  of  28  breast-ovary  cancer  families  using 
chromosome  17  markers.  Of  these,  169  women  had  been  diagnosed 
with  breast  cancer  by  1994  (mean  age  of  diagnosis  40.9  years),  72 
had  ovarian  cancer  (53.4  years)  and  25  women  had  cancers  of  both 
types  (table  I) .  98  women  are  currenty  alive  and  unaffected 
(mean  age  38.4  years)  and  19  have  died  of  another  cause.  The 
years  of  birth  of  the  women  ranged  from  1859  to  1973.  Women 
were  followed,  on  average,  until  the  age  of  50.0  years.  The 
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ratio  of  cases  of  breast  cancer  to  ovarian  cancer  was  2.3  to  1 
overall;  this  ratio  varied  from  1.90  (McGill)  to  3.83 
(Pennsylvania) . 


Information  on  reproductive  history  was  routinely  obtained 
on  living  study  subjects  at  the  time  blood  was  drawn  for  the 
genetic  analysis,  either  by  written  questionnaire  or  by  telephone 
interview.  This  information  included  their  year  of  birth  and  the 
number  and  ages  of  all  pregnancies.  Information  for  the  117 
deceased  women  was  obtained  from  a  close  relative,  usually  a 
sister  or  a  daughter.  Age  of  menarche  was  only  available  for  the 
McGill  and  Creighton  families. 


Statistical  analysis  The  data  was  analysed  by  survival  analysis 
methods .  Individuals  were  considered  to  be  at  risk  of  breast  or 
ovarian  cancer  from  the  age  of  twenty  until:  1)  the  development 
of  breast  or  ovarian  cancer;  2)  death  from  another  cause;  3) 
prophylactic  surgical  removal  of  the  breasts  or  ovaries; 
September  1994.  The  survival  curves  were  constructed  using 
Kaplan-Meir  estimates.  Differences  in  Kaplan-Meir  curves  were 
tested  for  statistical  significance  using  the  log-rank  test. 
The  effects  of  covariates  on  survival  was  modelled  using  Cox 
regression  assuming  a  proportional  hazards  model  (Cox,  1972) 
Nulliparous  women  were  excluded  from  the  univariate  analyses  of 
estimates  of  the  effect  of  age  of  first  birth  and  age  of  last 
birth.  When  multivariate  analysis  included  variables  for  both 
age  of  first  (or  last)  birth  and  parity,  the  age  of  first  (or 
last)  birth  was  coded  as  zero  for  nulliparous  women. 


RESULTS 


The  women  in  this  study  represent  a  historical  cohort  of  339 
identified  BRCAl  mutation  carriers  from  28  families,  each  with  a 
minimum  of  one  case  of  ovarian  cancer  and  three  cases  of  breast 
cancer.  The  age-specific  incidence  rates  of  breast  and  ovarian 
cancer  in  this  cohort  are  presented  in  figure  1.  The  maximal 
breast  cancer  incidence  was  observed  in  the  age  group  41  -  45 
years.  Breast  cancer  rates  declined  modestly  thereafter.  The 
ovarian  cancer  incidence  was  relatively  constant  above  the  age  of 
45,  and  was  greatest  between  ages  51  and  55.  For  women  oyer 
the  age  of  50  the  annual  risk  of  ovarian  cancer  exceeded  the  risk 
for  breast  cancer.  The  cumulative  incidence  of  breast  cancer  was 
52.2%  to  age  50  (95%  confidence  interval;  45.9%  -  58.4%)  and 
74.6%  to  age  70  (67.0%  -  82.2%);  for  ovarian  cancer  these  figures 
were  19.5%  to  age  50  (12.5%  -  24.5%)  and  56.3%  to  age  70  (45.9%  - 
66.7%)  . 

When  each  of  the  predictive  variables  was  considered  on  its 
own,  increased  risks  of  breast  cancer  were  associated  with 
menarche  below  the  age  of  12  years  (relative  risk  =  1.57;  p  = 
0.053),  with  parity  of  less  than  three  (relative  risk  =  2.04;  p  < 
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0.001)  and  with  year  of  birth  after  1930  (relative  risk  =  2.72;  p 
<  0.001).  The  cohort  effect  was  surprisingly  strong.  It  is 
estimated  that  42.4%  of  women  in  the  recent  cohort  will  be 
diagnosed  with  breast  cancer  by  age  40,  as  compared  to  12%  for 
women  born  before  1930. 


Because  of  the  possibility  that  the  observed  cohort  effect 
may  be  biased  by  the  means  of  ascertainment  of  the  pedigrees,  the 
analysis  was  repeated,  but  restricted  to  the  largest  single 
pedigree.  Utah  kindred  2082  was  ascertained  by  linkage  of  the 
Utah  Genealogy  to  the  Utah  cancer  registry,  based  on  a  cluster  of 
two  breast  and  four  ovarian  cancers,  and  was  independent  of  age- 
of  onset  of  the  breast  cancers.  The  relative  risk  in  this  single 
pedigree  associated  with  recent  cohort  was  3.84  (p  =  0.013). 
Because  of  the  large  size  of  this  pedigree  (25  cases  of  breast 
cancer  and  21  cases  of  ovarian  cancer)  the  penetrance  figures  are 
unlikely  to  be  influenced  unduly  by  the  effect  of  inclusion  or 
exclusion  of  a  small  number  of  individuals. 

The  penetrance  of  breast  cancer  was  not  different  for  BRCAl 
carriers  who  inherited  the  mutation  from  their  father  as  for 
those  who  inherited  it  from  their  mother  (p  =  0.96). 

Because  of  the  effect  of  year  of  birth  on  penetrance,  and 
because  the  risk  for  breast  cancer  varied  between  centers,  the 
effects  of  the  reproductive  variables  were  studied  while 
adjusting  for  study  center  and  for  birth  cohort.  Univariate  and 
multivariate  relative  risks  are  presented  in  table  II.  Age  of 
first  birth,  age  of  last  birth  and  parity  are  inter-related,  and 
a  series  of  alternate  models  was  evaluated  in  order  to  eliminate 
superfluous  variables.  In  no  model  were  age  of  first  birth  or 
age  of  last  birth  found  to  be  independently  predictive  of  breast 
cancer  risk,  and  both  these  variables  could  be  eliminated.  In 
contrast,  increasing  parity  was  found  to  be  protective  for  breast 
cancer  (table  III) .  A  relative  risk  of  0.85  was  estimated  for 
each  additional  birth  up  to  five  or  more  (p  =  0.002).  The 
cumulative  incidence  of  breast  cancer  in  women  with  two  or  fewer 
births  is  compared  to  the  incidence  in  women  with  three  or  more 
births  in  figure  2.  There  were  notable  trends  in  fertility 
patterns  within  this  cohort.  Women  born  after  1930  had,  on 
average,  fewer  births  (2.37  vs.  4.95)  and  an  earlier  average 
age  of  last  birth  (28.5  vs  34.5  years)  than  women  born  before 
1930.  However,  the  magnitude  of  the  cohort  effect  was  not 
diminished  by  adjustment  for  the  reproductive  variables  (table 
II)  . 

The  best-fitting  model  for  predicting  ovarian  cancer  risk  also 
included  parity,  but  the  direction  of  the  risk  associated  with 
multiple  births  was  opposite  to  that  found  for  breast  cancer. 
Each  birth  conferred  an  additional  40%  increase  in  risk  (relative 
risk  =1.40;  p  =  0.005)  for  up  to  five  births.  Although  women 
with  multiple  parity  were  at  increased  ovarian  cancer  risk, 
there  was  also  a  protective  effect  of  a  late  birth.  When 
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analysed  as  a  continuous  variable  in  the  multivariate  model, 
each  five-year  interval  in  age  of  last  birth  was  associated  with 
a  risk  reduction  of  18%  (relative  risk  =  0.82;  p  =  0.03). 
Compared  with  women  who  last  gave  birth  at  age  29  or  earlier, 
those  with  a  last  birth  at  age  30  or  above  were  at  48%  less  risk. 
Women  who  had  all  their  children  over  the  age  of  thirty,  or  who 
were  nulliparous  formed  a  low  risk  group  for  ovarian  cancer 
(relative  risk  =  0.30,  p  =  0.03). 

As  was  the  case  for  breast  cancer,  the  risk  for  ovarian 
cancer  was  much  higher  in  the  recent  birth  cohort  (relative  risk 
=  2.08  for  women  born  after  1930;  p  =  0.007).  To  eliminate  the 
possible  confounding  effect  of  oral  contraceptive  use  in  recently 
born  women  (oral  contraceptives  are  associated  with  low  parity 
and  with  reduced  ovarian  cancer  risk) ,  to  minimize  the  cohort 
effect,  and  to  dissociate  the  effect  of  ovarian  cancer  on 
subsequent  parity,  the  survival  analysis  was  repeated,  but  was 
restricted  to  the  subgroup  of  women  who  were  born  before  1930, 
and  who  were  cancer-free  at  age  35  (figure  3) .  The  results  of 
the  multivariate  model  in  this  restricted  analysis  were 
essentially  unchanged  -  high  parity  continued  to  be  predictive  of 
ovarian  cancer  risk  (relative  risk  =  1.95  for  each  additional 
birth  up  to  five;  p  =  0.00028)  and  late  birth  was  also  protective 
in  this  subgroup  (relative  risk  =  0.68  per  five  year  interval;  p 
=  0.005). 


DISCUSSION 


Several  observations  in  this  historical  cohort  suggest  the 
interaction  of  environmental  and  genetic  factors,  including  the 
incomplete  penetrance  of  the  BRCAl  gene  and  the  trend  towards 
earlier  diagnosis.  Reproductive  factors  do  not  appear  to  be 
sufficient  to  explain  the  observed  variability  in  gene  expression 
and  penetrance. 

In  our  study  we  estimated  a  risk  of  75%  to  age  70  for  breast 
cancer  and  56.3%  for  ovarian  cancer.  It  is  perhaps  surprising 
that  the  age-specific  risks  for  ovarian  cancer  exceed  the  risks 
for  breast  cancer  in  BRCAl  carriers  above  the  age  of  50. 
Because  each  family  was  selected  to  contain  at  least  one  case  of 
ovarian  cancer,  these  penetrance  estimates  are  pertinent  to 
families  with  the  breast-ovarian  cancer  syndrome.  The  total 
number  of  breast  cancers  outnumber  ovarian  cancers  by  2.3  to  1 
overall,  but  the  majority  (81.6%)  of  cases  of  breast  cancer 
occur  before  the  age  of  50,  and  many  of  these  women  did  not 
survive  the  initial  cancer.  Also,  we  did  not  consider  women  to 
be  at  risk  for  ovarian  cancer  after  oophorectomy.  Oophorectomy 
is  used  as  a  prophylactic  measure  and  may  also  be  used  in  the 
treatment  of  breast  cancer.  Our  risks  are  similar  to  those  of 
the  Breast  Cancer  Linkage  Consortium  (BCLC)  who  estimate  risks  of 
87%  for  breast  cancer,  and  44%  for  ovarian  cancer  to  age  70 
(Easton  et  al,  1994;  Easton  et  al,  1995).  Our  estimate  of 
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ovarian  cancer  penetrance  is  higher,  but  each  of  our  families  was 
selected  because  it  contains  at  least  one  case  of  ovarian  cancer. 
BRCAl  mutations  have  also  been  identified  in  families  with  site- 
specific  breast  cancer,  but  in  our  experience  this  group 
represents  a  minority  of  the  families  with  mutations. 


Goldgar  et  al  (1994)  found  that  the  sex  of  the  carrier 
parent  in  family  2082  was  significantly  associated  with  the  type 
of  cancer  observed  in  the  child  (breast  versus  ovarian) .  We 
found  no  influence  of  the  sex  of  the  transmitting  parent  on  the 
penetrance  of  either  type  of  cancer  in  our  extended  series  of 
pedigrees . 

In  this  cohort  of  BRCAl  carriers  the  risk  of  cancer  of  either 
type  was  much  higher  in  women  born  after  1930.  The  effect  was 
equally  strong  using  cut-off  years  of  1920  (i.e.  relative  risk 
of  2.68  for  breast  cancer)  or  of  1940  (relative  risk  3.06  for 
breast  cancer) .  We  do  not  believe  that  the  trend  towards 
earlier  diagnosis  can  be  explained  by  better  screening  in  the 
recently  born  women.  It  is  our  experience  that  most  tumors  have 
been  identified  by  the  detection  of  a  palpable  mass  by  the 
patient  herself  or  by  her  physican.  There  has  been  no 
improvement  in  life  expectancy  consistent  with  extensive 
screening;  in  the  earlier  cohort  survival  was  80.0%  to  age  50, 
and  in  the  recent  cohort  suvival  to  age  50  is  74.9%. 
Mammography  data  was  not  routinely  available  for  the  women  in  all 
families,  and  this  will  be  the  subject  of  a  future  prospective 
study . 

Bias  in  the  ascertainment  of  pedigrees  could  lead  to  spurious 
cohort  effects,  in  particular  if  recently  diagnosed  early-onset 
cases  were  to  lead  to  the  selective  inclusion  of  families  in  the 
study.  We  do  not  feel  that  our  sampling  strategy  has  produced  a 
biased  result  for  several  reasons.  The  penetrance  of  breast 
cancer  was  significantly  greater  in  the  recent  cohort  in  all  four 
centers.  The  cohort  effect  was  clearly  present  in  the  single 
large  Utah  pedigree.  Kindred  2082  (see  results) .  The  Creighton 
families  have  been  ascertained  over  the  past  25  years  and  many  of 
the  recent  breast  cancer  cases  were  diagnosed  following  the 
original  ascertainment.  Ten  of  the  20  cases  of  cancer  in  the 
largest  family  (kindred  1816)  were  diagnosed  since  1978,  at 
which  time  the  pedigree  was  ascertained.  A  spurious  temporal 
effect  could  also  result  if  women  were  preferentially  included  in 
the  recent  cohort  because  they  were  affected.  All  women  at  risk 
in  these  28  families  were  asked  to  participate  in  the  genetic 
study,  whether  or  not  they  were  affected  with  cancer,  and  in  most 
cases,  the  sampling  of  women  in  the  relevant  sibships  was 
complete.  Only  women  for  whom  DNA  testing  revealed  the  BRCAl 
carrier  haplotype,  directly  or  by  inference,  were  included.  The 
proportion  of  subjects  in  the  recent  cohort  who  were  affected 
(57.1%)  was  in  fact  lower  than  the  proportion  of  affected  women 
in  the  earlier  cohort  (79.2%).  Preferential  sampling  of  affected 
women  should  lead  to  an  overestimate  of  penetrance.  Our 
estimates  of  breast  cancer  penetrance  is  slightly  lower  than  the 
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BCLC  figure,  which  was  derived  using  other  methods  (Eastone  et 
al,  1995) .  The  reason  for  the  observed  cohort  effect  is 
unclear,  but  does  not  seem  to  be  explainable  by  the  trends  in 
fertility.  Similar  patterns  of  increasing  risk  in  families  with 
dominant  cancer  syndromes  have  been  reported  for  familial 
malignant  melanoma  (Battistutta  et  al,  1994)  and  the  Li-Fraumeni 
syndrome  (L.  Strong,  personal  communication) . 

The  risk  of  breast  cancer  declined  with  increasing  parity  in 
BRCAl  carriers,  in  keeping  with  studies  of  breast  cancer  in  the 
general  population  (Brinton  et  al,  1982;  Rosneret  al,  1994)  but 
an  early  age  of  first  birth  did  not  confer  additional  protection. 
Giving  birth  at  a  young  age  has  been  observed  to  protect  against 
breast  cancer  in  the  general  population  in  several  studies 
(Brinton  et  al,  1982;  Rosner  et  al,  1994). 

Alternate  models  proposed  to  explain  the  epidemiology  of 
sporadic  ovarian  cancer  include  an  increased  risk  from  incessant 
ovulation  or  from  prolonged  exposure  to  raised  levels  of 
gonadotropins  (Whittemore  et  al,  1992) .  Neither  of  these  models 
is  consistent  with  the  pattern  we  observe  in  hereditary  ovarian 
cancer.  Pregnancy  is  associated  with  a  reduction  in  the  number 
of  ovulations  and  with  decreased  gonadotropin  levels,  but  is 
associated  with  an  increased  risk  of  ovarian  cancer  in  BRCAl 
carriers.  The  risk  of  sporadic  ovarian  cancer  is  reduced  after 
giving  birth  (Adami  et  al,  1994).  The  protection  offered  by  a 
late  birth  in  BRCAl  carriers  is  consistent  with  the  hypothesis 
of  pregnancy-dependent  clearance  of  malignant  ovarian  cells 
raised  by  Adami  et  al  (1994).  Multiparity  is  protective 
against  sporadic  ovarian  cancer  (Adami  et  al,  1994)  but  appears 
to  be  a  risk  factor  in  BRCAl  carriers.  In  a  previous  study  of 
the  Utah  population  database  (Kerber  and  Slattery,  1995)  it  was 
found  that  giving  birth  to  six  or  more  children  did  not  protect 
against  familial  ovarian  cancer  (odds  ratio  =  1.11),  but  was 
strongly  protective  against  non-familial  ovarian  cancer  (odds 
ratio  =  0.29). 


Oral  contraceptive  use  reduces  the  risk  of  ovarian  cancer  in 
the  general  population  (Parazzinni  et  al,  1991)  and  deserves 
consideration  as  a  potential  preventive  therapy,  but  it  is  not 
yet  clear  if  the  effect  of  oral  contraceptives  on  BRCAl- 
associated  ovarian  cancer  will  be  in  the  same  direction.  It  is 
of  equal  importance  to  establish  if  the  use  of  oral 
contraceptives  influences  the  incidence  of  breast  cancer  in  BRCAl 
carriers,  especially  in  light  of  our  unexplained  observation  of 
increased  rates  of  cancer  among  recent  generations  of  BRCAl 
carriers . 


Genetic  counselling  and  risk  assessment  by  using  linked 
markers  and  by  direct  mutation  detection  is  now  offered  in  our 
centers  and  elsewhere.  It  is  common  for  women  to  ask  about  the 
impact  of  the  known  risk  factors  on  hereditary  breast  and  ovarian 
cancer,  and  how  they  may  modify  their  personal  risk.  The 
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absolute  risk  of  breast  cancer  in  these  women  declined  after  the 
time  of  the  menopause,  suggesting  that  ovarian  hormones  may  be 
important  in  promoting  tumour  development  in  BRCAl  carriers.  The 
effect  of  prophylactic  oophorectomy  or  of  tamoxifen  use  in 
preventing  BRCAl-associated  breast  cancer,  and  the  safety  of 
hormone  replacement  therapy  deserve  particular  attention  in 
controlled  follow-up  studies.  Our  results  indicate  that  it  is 
inadequate  to  extrapolate  from  observations  of  sporadic  cancers 
and  that  studies  of  the  hereditary  subgroups  are  required. 
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Table  I.  Families  used  in  the  Analysis 


Centre 


Families 


Total  Number 

Carriers  with 

Breast 

Cancer 


Number  Unaffected 

with  Carriers 

Ovarian 
Cancer 


Creighton 

16 

162 

87 

36 

53 

Utah 

3 

103 

40 

19 

47 

McGill 

6 

34 

19 

10 

8 

Pennsylvania 

3 

34 

23 

7 

9 

TOTAL 

28 

333 

169 

72 

117 
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Table  I ,  Continued 


Average  age  of 
last  birth 


Parity  Average  age  of 
first  birth 


Creighton 

2.93 

22.9 

30.0 

Utah 

3.94 

23.1 

32.1 

McGill 

2.67 

23.1 

31.6 

Pennslyvania 

3.03 

22.1 

28.8 

TOTAL 

3.22 

22.9 

30.7 
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Table  II.  Cox  proportional  hazards  modelling  of  the  effect  of 
reproductive  variables  on  breast  cancer  incidence;  complete 
model . 


Variable 

Cancer 

Univariate  p 

Multivariate  p 

Relative 

Relative 

Risk 

Risk 

COHORT 

Breast 

2.75 

<0.001 

2.53 

<0.001 

Ovary 

2.37 

0.003 

2.17 

0.01 

AGE  OF  FIRST 

Breast 

0.94 

0.43 

0.98 

0.85 

BIRTH 

Ovary 

0.64 

0.007 

1.18 

0.32 

AGE  OF  LAST 

Breast 

0.76 

<0.001 

1.02 

0.85 

BIRTH 

Ovary 

0.70 

0.003 

0.73 

0.03 

PARITY 

Breast 

0.78 

<0.001 

0.84 

0.06 

Ovary 

1.05 

0.54 

1.47 

0.003 

Definitions  of  variables:  COHORT:  born  after  1930  versus  born 
before  or  in  1930;  AGE  OF  FIRST  BIRTH:  Relative  risks  for  each 
five-year  increment  in  age  of  first  birth;  AGE  OF  LAST  BIRTH: 
Relative  risks  for  each  five-year  increment  in  age  of  last  birth; 
PARITY:  Relative  risks  are  for  each  additional  birth  up  to 
five,  compared  to  nulliparous  women.  Relative  risks  are  adjusted 
for  other  variables  in  the  table  as  well  as  for  study  centre. 
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Table  III.  Cox  proportional  hazards  modelling  of  the  effect  of 
reproductive  variables  on  breast  cancer  incidence:  final  model. 


Variable 

Cancer 

Multivariate 

Relative 

Risk 

p-value 

COHORT 

Breast 

2.42 

<0.001 

Ovary 

2.07 

0.007 

AGE  OF  LAST 

Breast 

**** 

*  *  * 

BIRTH 

Ovary 

0.82 

0.03 

PARITY 

Breast 

0.85 

0.002 

Ovary 

1.40 

0.005 

Definitions  of  variables  as  in  table  II.  Relative  risks  are 
adjusted  for  other  variables  in  the  table  and  for  study  center. 
*****  Age  of  last  birth  was  not  included  in  the  model  for  breast 
cancer. 
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LEGENDS  FOR  FIGURES 


Figure  1.  Incidence  of  breast  and  ovarian  cancer  in  cohort  of 
BRCAl  carriers. 

Figure  2.  Influence  of  parity  on  breast  cancer  incidence. 


Figure  3.  Influence  of  parity  on  ovarian  cancer  incidence  for 
women  born  before  1930.  All  women  cancer-free  at  age  35. 
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BRCA1  a  breast  and  ovarian  cancer  susceptibility  locus,  has  been  isolated  and  maps  to  17q21.  A  physical  map 

repeat  ,STR,  PC, 

sS  was Trther  rocallzed  to  a’reglon  boa’nded  by  D17S132,  on  ,he  proximal  “ 

Side  Within  this  estimated  600  kb  region,  the  contig  was  composed  comple  ely  of  Pis  and  BACs  orde  e  y 
STS-content  mapping  and  confirmed  by  DNA  restriction  fragment  fingerprinting.  _ 


INTRODUCTION 

The  isolation  of  genes  through  positional  cloning  requires  a 
minimal  region  containing  the  gene  of  interest,  usually  defin^ 
by  identification  of  key  recombinant  meiotic  breakpoints  in 
families,  and  a  detaded  physical  map  composed  of  cloned 
segments  of  DNA  which  span  the  region.  A  general  strategy  to 
clone  and  map  large  regions  of  human  DNA  using  STS-content 
mapping  was  demonstrated  for  the  region  of  chromosome  7 
containing  the  cystic  fibrosis  gene  (1).  Huntington’s  disease  (HD) 
provides  an  example  of  the  creation  and  utilization  of  a  detailed 
physical  map  to  isolate  a  gene  by  positional  cloning.  Initially, 
physical  maps  of  the  defined  genetic  region  were  constructed 
using  yeast  artificial  chromosome  (YAC)  clones  (2,3).  The 
presence  of  deletions  and  chimerisms  and  the  large  DNA  insert 
size  of  these  vectors  made  it  clear  that  smaller  clones  would  be 
more  useftil  for  positional  cloning.  Subsequendy,  within  this 
mapped  region,  cosmid  contigs  (4)  and  cosmd/Pl  contigs  (5) 
were  developed  which  allowed  for  the  identification  of  several 
genes  in  the  region,  as  well  as  the  identification  of  the  HD  gene 

itself  (6).  ,  . 

The  most  recent  success  of  a  positional  cloning  strategy  is  the 
isolation  of  BRCAl,  a  gene  for  familial  ovarian  and  early-onset 
breast  cancer  f7).  The  search  for  BRCAl  began  in  earnest  in  1990 
with  a  report  of  genetic  linkage  to  a  small  region  on  the  long 
arm  of  chromosome  17  (8).  Based  on  multipoint  linkage  analysis, 
BRCAl  was  inidally  localized  to  the  interval  between  D17S250 


and  D17S579  (9).  Subsequendy,  addidonal  recombinants  were 
found  in  linked  kindreds  which  refined  the  proximal  boundary 
of  the  BRCAl  region  to  D17S776  (10)  and  the  distal  boundary 
to  D17S78  (11).  Recendy,  a  physical  map  of  the  BRCAl  region 
from  ERBB2  to  D17S509  was  published  which  was  comprised 
primarily  of  YACs  (12). 

As  part  of  our  effort  to  isolate  the  BRCAl  gene,  the  strategy 
was  to;  1)  create  a  physical  map  of  the  large  defined  BRCAl 
region;  2)  isolate  polymorphic  short  tandem  repeat  markers 
(STOs)  from  these  inidal  physical  resources;  3)  use  these  STRs 
in  our  set  of  linked  kindreds  which  contained  informadve 
recorabinadon  events  to  refine  the  region  containing  BRCAl;  ^d 
then  4)  construct  a  finer  and  more  detailed  physical  map  in  this 
narrower  region.  This  strategy  proved  to  be  a  key  factor  in  our 

successful  isoladon  of  BRCAl . 

In  this  report,  we  present  our  preliminary  physical  map  ot  me 
inidal  region  from  D17S776  to  D17S78,  detail  the  genedc  evenu 
which  localized  BRCAl  to  a  smaller  region,  and  show  me 
Pl-based  physical  map  of  this  smaller  region. 


RESULTS 

In  order  to  develop  a  physical  map  of  the  refined  BRCAl  region 
as  quickly  and  efficiently  as  possible,  we  initiated  screens  tor 
YACs  and  Pis  at  four  starting  points  whose  reladve  locations 
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were  known.  The  map  was  constructed  by  utilizing  STS  markers 
in  the  region,  isolating  clones,  and  then  walking  from  the  ends 
of  the  developing  contigs  using  STSs  developed  from  those  ends. 
The  STS-content  data  for  all  STSs  tested  against  each  of  the 
cloned  DNA  segments  is  shown  in  Figure  1.  To  rapidly  merge 
contias,  we  screened  initially  for  YACs  as  their  insert  sizes  are 
laraer  than  BACs,  Pis,  and  cosmids.  At  least  10  of  25  YACs 
identified  in  this  study  appeared  to  be  chimeric  or  contained 
deletions,  so  we  independently  covered  the  smallest  region 
containing  BRCAI  in  Pis  and  BACs. 

Initially,  several  unlinked  contigs  were  created  around  the  four 
starting  points  in  the  region;  D17S776  (13),  EDH17B2  (14), 
D17S855  (15),  and  RJNU2  (16).  The  D17S776  and  EDH17B2 
contias  were  oriented  and  connected  with  Pi -900  and  YAC 
714G3  (Fi<J.  2a).  This  proximal  contig  was  then  connected  to 
the  D 17S855  contig  by  YAC  652E10  (Fig.  2a).  The  RNU2  contig 
was  joined  to  our  previous  physical  map  (Miki  et  oL,  submitted) 
at  D17S78  through  PI  walking.  At  this  point,  there  were  two 
contigs,  a  proximal  contig  containing  D17S776,  EDH17B2,  and 
D17S855,  and  a  distal  contig  containing  RNU2  and  D17S858 
(15).  The  contig  was  completed  with  identification  of  BAC  283 
and  Pl-402  (Fig.  2a). 

As  the  contig  was  being  assembled,  walking  in  a  PI  library 
was  done  to  independently  isolate  regions  covered  by  YACs  and 
to  merge  contigs.  When  PI  screens  were  negative,  cosmid 
subclones  (from  YACs)  or  a  BAC  library  were  screened.  To  close 
the  PI  gaps  from  754SP6  to  1239SP6  and  from  1 141T7  to  402T7, 
BACs  694B12  and  283G2  were  isolated.  BAC  694  was  shown 
to  contain  both  of  these  STSs  (Fig.  1).  BAC  283  was  shown  to 
encompass  Pl-754,  Pl-975,  and  Pl-1141  and  included  part  of 
Pl-402  (Fig.  2a). 

A  total  of  25  YACs  were  identified  which  mapped  to  the 
reaion.  Of  these,  1 1  YACs  are  included  on  the  map  in  Figure 
2a”because  they;  1)  span  areas  where  there  is  no  odier  coverage; 
2)  link  smaller  contigs;  3)  contain  STSs  used  to  isolate  Pis;  or 
4)  contain  STRs  which  were  identified  by  screening  cosmid 
subclones  from  those  YACs.  Fifty-two  Pis  were  identified  which 
mapped  to  the  region,  of  which  29  are  included  on  the  map.  Only 
the  minimal  set  of  Pis  found  which  are  necessary  for  a  complete 
contia  and/or  contain  STRs  are  included  on  the  inap. 

To'aid  in  construction  of  the  physical  map  and  to  further  refine 
the  BRCAI  region,  polymorphic  STRs  were  developed  by 
sequencing  with  CA  dinucleotide  repeat  primers  or  by  direct 
capture  of  a  series  of  di-,  tri-,  and  tetranucleotide  repeats.  STRs 
D17S1320,  D17S1326,  D17S1327  (ED2;  10),  D17S1325,  and 
D17S1329  were  isolated  from  cosmid  subclones  and  STRs 
D17S1321,  D17S1328,  D17S1322,  D17S1323,  and  D17S1324 
were  from  Pis.  In  Table  1,  the  primer  sequences,  size  of  the 
product,  number  of  alleles  and  percent  heterozygosity  of  the  new 
STRs  are  presented. 

In  order  to  define  the  mirumal  region  containing  the  BRCAI 
aene,  we  tested  the  new  STRs  in  two  previously  defined 
recombinant  breakpoints.  For  defining  the  proximal  boundary , 
kindred  2082  contained  the  relevant  recombination  event.  Kindred 


2082  is  the  largest  S/?C4i-linked  breast/ovarian  cancer  family 
smdied  to  date.  It  has  a  lod  score  of  9.5  providing  unequivocal 
evidence  for  17q  linkage.  This  family  is  described  in  Goldgar 
et  al.  (10),  in  which  a  critical  recombinant  placing  BRCAI  distal 
to  D17S776  was  identified.  This  recombinant  occurred  in  a 
woman  diagnosed  with  ovarian  cancer  at  age  45  whose  mother 
had  ovarian  cancer  at  age  63.  The  affected  mother  was  deceased, 
but  could  be  inferred  from  her  children  to  have  the  linked 
haplotype,  which  was  also  present  in  the  36  other  linked  cases 
in  the  family,  in  the  region  between  D17S776  and  D17S579.  Her 
affected  daughter  received  the  linked  allele  at  D17S1328, 
D17S1327,  and  more  distal  loci,  but  received  the  allege  on  the 
non-BRCAl  chromosome  at  D17S776,  D17S1320  and 
D17S1321.  The  recombinant  described  above  therefore  placed 
BRCAI  distal  to  D17S1321. 

For  defining  the  distal  BRCAI  boundary,  kindred  1813  was 
analyzed.  Kindred  1813  is  a  small  family  with  four  cases  of  breast 
cancer  diagnosed  under  the  age  of  40  whose  mother  had  breast 
cancer  diagnosed  at  age  45  and  ovarian  cancer  at  age  6L  This 
family  yields  a  maximum  multipoint  lod  score  of  0.60  with  17q 
markers  and  given  that  there  is  at  least  one  case  of  ovman  cancer, 
results  in  a  posterior  probability  of  being  a  SRC47-linked  family 
of  0.94.  This  family  contains  a  directly  observable  recombination 
event  in  a  woman  who  developed  breast  cancer  at  age  34  which 
previously  indicated  that  BRCAI  was  proximal  to  D17S78  (11). 
The  genotype  of  her  affected  mother  at  the  relevant  17q  loci  was 
inferred.  The  affected  individual  inherited  the  5RC47-linked 
alleles  for  D17S250,  THRAl,  D17S800,  D17S855,  D17S1326, 
and  D17S1324.  For  markers  distal  to  D17S1324,  i.e.,  D17S1325. 
D17S1329,  D17S858,  and  D17S78,  she  inherited  the  alleles 
located  on  the  non-disease  bearing  chromosome.  The  evidence 
from  this  family  therefore  placed  the  BRCAI  locus  proximal  to 
the  marker  D17S1325. 

The  minimal  region  containing  BRCAI,  bounded  proximally 
by  D17S1321  and  distally  by  D17S1325,  was  contained  within 
a  PI  and  BAC  contig  constructed  based  on  STS  content  (Fig. 
1).  In  order  to  confirm  the  order  and  continuity  of  this  contig, 
restriction  fragment  fingerprinting  was  performed.  Both  ffi/JcUn 
and  EcoRI  digests  among  Pis  and  BACs  in  the  region  were 
compared  for  overlaps  among  the  clones.  The  EcoRI  restriction 
fragment  fingerprints  are  shown  in' Figure  3.  Further  evidence 
for°the  overlap  of  clones  came  from  observations  of  identical 
DNA  sequences  on  adjacent  Pis  (data  not  presented). 

With  refinement  of  the  physical  map,  the  Pis  and  BACs  were 
used  as  probes,  selectors,  and  templates  for  direct  screening  of 
cDNAs,  hybrid  selection  of  cDNAs,  and  random  sequencing, 
respectively  (7).  The  BRCAI  gene  (7)  is  located  on  BAC  283, 
totally  encompasses  Pl-754,  and  contains  parts  of  BAC  694, 
Pl-975  and  Pl-1141,  as  expected  due  to  the  overlapping  nature 
of  these  clones  (Fig.  2a).  STRs  D17S1323,  D17S855,  and 
D17S1322  are  located  in  introns  of  the  BRCAI  gene.  The  exact 
size  of  the  gene  is  not  yet  known,  but  is  estimated  to  be  100  kb. 

Other  identified  genes  were  localized  with  respect  to  our 
physical  map.  EDH17B2  (14)  was  used  as  one  of  the  starting 


Figure  1.  PI.  YAC,  BAC.  and  cosmid  clones  in  the  BRCAI  region.  The  clones.  STS  markers  and  J 

mat  are  present  in  a  clone  are  indicated  with  a  ^  and  those  that  are  absent  are  indicated  with  a  - 

the  clone.  YAC  origin  and  individual  plate  and  well  numbers  tor  the  Pis  and  BACs  are  listed  under  the  Address  co  u  . 
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Figure  2  Physical  maps  of  the  BRCAJ  region.  The  rwo  maos  are  drawn  to  scale  and  overlap  in  the  region  between  EDH17B2  and  PPY.  Cen  and  tel  denote  toward 
the  centromere  and  telomere,  respectively,  (a)  Physical  map  of  YACs,  Pis,  BACs,  and  cosmid  clones  from  D17S776  to  PPY.  (My  n^al  sets  of  PI  clones 
and  YACs  for  the  contig  are  shown.  The  refined  BRCAJ  region  is  contained  within  the  bold  line  of  the  map.  The  location  of  the  BRCAJ  gene  is  shown  m  me 
box  and  is  described  in  the  text.  Polymorphic  markers  and  senes  are  indicated  by  vertical  lines  on  the  map.  YAC  clones  are  drawn  above  the  map.  L  and  R  represent 
the  left  and  right  arms  of  the  YAC  vector  and  indicate  the  STS  developed  for  walking.  Dashed  lines  indicate  the  chimeric  portions  and  the  parenthesis  indicates 
a  deletion  within  the  YAC.  PI  clones,  BACs  283  and  694,  and  cosmid  CF4  are  drawn  below  the  map.  The  T7  primer  end  of  the  PI  clone  msen  is  mdicated  by 
an  arrow  on  the  solid  line.  The  other  end  of  the  PI  clone  insen  is  flanked  by  the  SP6  primer,  and  that  of  the  cosmid  insert  is  flanked  by  the  T3  pnmer  w^h 
are  shown  as  dots.  The  arrow  and  dot  indicate  that  a  STS  was  developed  for  walking,  (b)  Pulsed  field  gel  restriction  map  for  a  2  Mb  region  containing  EDHl^ 
and  D17S78.  Markers  are  localized  to  a  restriction  fragment:  exact  locations  are  unknown.  NT  and  NR  repr^ent  Notl  and  Nrul  sites  that  are  unmethylated  in  DNA 
from  MRC5  normal  human  fibroblasts  and  human  mammary  epithelial  cells.  Fibroblast  and  mammary  epithelial  cell  DNA  yielded  similar  results. 
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points  for  the  map.  PPY  (17)  was  localized  to  within  45  kb  of 
D17S78,  based  on  data  that  a  cosmid  contained  both  of  them 
(18).  Other  genes  mapped  by  STSs  were  Ki  antigen  (19),  human 
L21  (12),  MOXl  (20),  VHR  (21),  CA125  (22),  and  RNU2  (16). 
RNU2  is  a  tandem  array  of  genes  of  approximately  120  kb  (16) 
and  is  localized  on  the  map  based  on  a  STS  that  mapped  to  Pis 
405  and  931  (Fig.  1). 

The  size  of  the  region  from  D17S776  to  D17S78  is  estimated 
to  be  at  least  2.3  Mb  based  on  the  sum  of  size  of  YAC  714G3 
and  the  pulsed  field  map.  The  distance  from  D17S776  to 
EDH17B2  is  a  minimum  of  800  kb  based  on  the  size  of  YAC 
714G3  (12)  which  lies  between  them.  The  size  of  the  region  from 
EDH17B2  to  D17S78  is  approximately  1.5  Mb  based  on  a  pulsed 
field  map  constructed  from  Notl  and  Nrul  digests  (Fig.  2b).  For 
the  region  from  D17S1321  to  D17S1j25,  estimates  of  sizes  of 
the  Pis  and  BACs  and  degree  of  overlap  among  clones  were 
computed  based  on  restriction  fragment  fingerprints  (Fig.  3,  Table 
2).  The  distance  from  D17S1321  to  D17S1325  is  estimated  to 
be  no  more  than  600  kb.  This  estimate  is  consistent  with  the 
pulsed  field  mapping  results. 

DISCUSSION 

This  report  presents  a  physical  map  of  the  minimal  BRCAl  region 
completely  subcloned  into  bacterial  vectors.  It  includes  all  the 


Figure  3.  Restricdon  fragments  generated  with  fcoRJ  from  Pis  and  BACs  fjetween 
DI7S1321  and  D17S1325.  Not  ail  the  Pis  shown  here  were  included  in  Figure  2a. 


Pis,  BACs,  and  STSs  comprising  the  minimal  tiling  path.  Nine 
polymorphic  STRs  were  identified  which  were  used  to  refine  the 
region  genetically .  These  nine  STRs  and  the  5 1  end-clone  STSs 
were  used  to  screen  for  Pis  and  to  better  determine  PI  and  BAC 
overlaps.  We  determined  that  the  size  of  the  region  between 
D17S1321  and  D17S1325  was  less  than  600  kb,  within  which 
lies  the  recently  isolated  BRCAl  gene  (7). 

Contig  assembly  began  with  YACs  and  proceeded  to  Pis, 
BACs,  and  cosmid  subclones  which  are  more  convenient  reagents 
for  gene  isolation  and  are  less  prone  to  deletion  and  chimerism. 
Albertsen  etal.  (12),  during  construction  of  thtii  BRCAl  physical 
map,  found  that  several  parts  of  the  map  were  not  represented 
by  YACs  due  to  small  deletions  or  rearrangements.  They  were 
able  to  overcome  most  of  the  problems  identified  by  isolating 
Pis.  We  also  found  that  many  of  the  YACs  were  chimeric  or 
contained  deletions.  Of  the  YACs  in  Figure  2a,  the  right  arms 
of  26F3  and  652E10  were  chimeric,  303G8  contained  an  internal 
deletion,  and  the  distal  portion  of  641A11  was  chimeric.  Other 
YACs  such  as  712A7  and  693E8  were  not  included  on  the  map 
due  to  a  three-part  chimera  where  neither  arm  was  on 
chromosome  17,  nor  was  851H11  included  due  to  a  large 
deletion.  Pis  and  BACs  are  reported  to  be  more  stable  and  less 
prone  to  deletion  and  chimerism  (23,24).  In  addition,  it  is 
relatively  easy  to  isolate  microgram  quantities  of  insert  DNA  from 
BACs  and  Pis.  For  the  600  kb  map  of  the  refined  BRCAl  region 
reported  here,  there  is  complete  coverage  with  Pis  and  BACs, 

with  no  known  gaps  of  DNA. 

The  size  of  the  region  from  D17S776  to  D17S78  is  estunated 
to  be  2.3  Mb,  with  the  distance  from  EDH17B2  to  PPY  estimated 
to  be  1.5  Mb  based  on  long-range  pulsed  field  restriction 
mapping.  Within  the  refined  BRCAl  region  defined  by  D17S1321 
and  D17S1325,  the  distance  is  estimated  to  be  approximately  600 
kb  based  on  restriction  fragments  of  the  Pis  and  BACs,  which 
is  consistent  with  the  pulsed  field  gel  results.  Slight  differences 
between  the  two  maps  are  likely  due  to  experimental  variance 
in  measurements.  In  the  pulsed  field  analysis,  we  only  can 
determine  the  fragment  on  which  a  particular  marker  is  located. 
The  exact  location  of  markers  on  large  restriction  fragments 
cannot  be  ascertained.  For  the  size  estimate  based  on  £coRI  and 
w/nHTTT  restriction  fragments,  distances  are  imprecise  for 
insert/vector  junction  fragments  where  the  two  Pis  overlap,  as 
they  will  not  share  a  common  restriction  fragment.  In  addition. 


Table  2.  Estimated  sizes  for  Pis  and  BACs  between  D17S1321  and  D17S1325 


Gone 


Size  (Kb) 


PM309 

PM308 

PM238 

Pl-1239 

BAC  694 

BAC  283 

Pl-754 

Pl-975 

PM141 

Pl-402 

PM046 

P1^5 

Pl-931 

Pl-384 

P1417 


70 

85 

70 

65 

no 

130 

40 
83 
60 
76 

41 
35 
43 
70 
78 
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we  did  not  investigate  the  RNU2  region  in  detail,  and  there  may 
be  small  deletions  of  the  Pis  within  the  RNU2  gene  duster  based 
on  the  smaller  insert  sizes  of  Pl-405  and  Pl-931  which  are  35 
and  43  kb  respectively  versus  an  average  PI  size  of  80  kb  (25). 

These  clones  facilitated  systematic  screening  of  cDNA  libraries 
for  transcribed  sequences,  followed  by  mutation  screening  of 
candidate  genes  in  BRCAl  carriers.  Although  the  purpose  for 
which  the  map  was  originally  constructed  (i.e.,  isolation  of 
BRCAl)  has  now  been  completed,  the  physical  map  and  STR 
information  presented  in  this  manuscript  are  still  of  value  for 
researchers  studying  the  breast  cancer  gene  and  other  genes  in 
the  region.  Over  the  past  several  years,  the  BRCAl  region  has 
been  intensively  studied  by  many  research  groups.  With  a 
collaborative  effort  to  compile  this  wealth  of  physical  mapping 
and  DNA  sequence  data,  it  should  be  possible  to  investigate  the 
comolete  eenomic  structure  of  this  sene-rich  segment  of  human 
DNA. 

MATERIALS  AND  METHODS 

Vectors 

iMCs,  The  YAC  libraries  used  were  from  the  Centre  d’Etude  du  Polymorphisme 
Humain  (CEPH  YAC  library)  (26)  and  Washington  University  (St.  Louis  YAC 
library')  (27).  YACs  used  in  the  construction  of  the  map  are  listed  in  Figure  1. 
Positive  screenings  of  YAC  26F3  with  EDH17B2  and  YACs  B196A12  and 
A167E6  with  RNU2  were  reported  by  F. Collins  at  a  Chromosome  17  Workshop 

(28) .  YAC  303GS  was  positive  with  EDHI7B2  as  reported  to  GDB  by  S.Thomas 

(29) . 

For  screening  libraries,  PCR  reactions  were  carried  out  in  a  10  /d  volume, 
containing  1  ng  p>ooled  YAC  DNAs,  0.5  piM  each  primer,  200  /iM  dNTP ,  10 
iriM  Tris-HCl,  50  mM  KCl,  1.5  mM  MgQj,  0.01%  gelatin,  and  0.25  U  of 
Taq  polymerase  in  a  Thermal  Cycler  (GeneAmp  PCR  System  9600,  Perkin-Elmer 
Cetus)  for  35  cycles  of  94°C  for  10  s,  55°C  for  10  s,  12°C  for  10  s.  Products 
were  visualized  on  1.2%  agarose  gels  stained  with  ethidium  bromide, 

YAC  vector-/! /u  PCR  was  performed  to  isolate  YAC  end  sequences.  Primers 
used  for  PCR  were  mtcr-Alu  primers  IIL,  2IL,  2IR,  and  2MR  and  PFYL  for 
the  left  arm  of  the  YAC  vector  and  PFYR  for  the  right  arm  of  the  YAC  vector 
(Table  3).  PCR  conditions  were  the  same  as  described  above.  The  PCR  products 
from  the  primer  pair  of  Alu  primer  and  YAC  vector  primer  were  compared  to 
products  using  o^y  Alu  primer.  Unique  Alu-vecior  products  were  isolated  and 
sequenced.  When  isolation  of  YAC  end  sequences  by  YAC  vcctor-Alu  PCR  was 
unsuccessful,  the  same  procedure  was  done  using  a  set  of  12  random  primers 
instead  of  the  Alu  primers.  The  PCR  products  firom  the  primer  pair  of  a  random 
primer  and  YAC  veaor  primer  were  compared  to  products  using  only  the  random 


primer.  Unique  random  primer-vector  products  were  sequenced.  This  allowed 
us  to  sequence  the  ends  of  ail  the  YACs.  Ail  end-YAC  STSs  were  checked  against 
somatic  cell  hybrids.  ME3-5  and  ME3-7  (Cerosaletti  et  ai,  submitted)  to  ensure 
that  they  werc  on  17ql2-q21.  DNA  sequencing  was  done  by  the  chain  termination 
method  (30)  using  Sequenasc  (USB,  aeveland,  OH)  or  by  cycle  sequencing  using 
a  Cyclist™  Taq  DNA  Sequencing  Kit  (Stratagene,  CA). 

Pis.  The  PI  library  (DMPC-HFF  #  1 ;  25)  was  screened  by  a  PCR-based  strategy 
and  positive  clones  were  isolated  by  GENOME  SYSTEMS,  INC.  (Missouri). 
Pis  and  BACs  reported  here  are  available  for  research  purposes  at  no  cost  upon 
request  to  the  corresponding  author.  PI  DNAs  were  prepared  using  an  alkaline 
lysis  plasmid  isolation  protocol  and  purified  by  equilibrium  centrifugation  in 
CsCl-“ ethidium  bromide  gradients.  PI  inserts  were  sequenced  with  T7  and  SP6 
as  described  above  for  YAC  sequencing.  End-specific  STSs  were  used  for  the 
next  screening  of  the  PI  library. 

BACs.  BAC  screening  was  by  a  hybridization-based  strategy  with  STS  754SP6. 
A  radioactive  probe  was  annealed  with  total  human  DNA  before  adding  to  the 
hybridization  buffer.  Forty-two  fillers  containing  96,000  BAC  clones  were 
hybridized  and  washed  twice  with  2xSSC~0.1  %  SDS  at  47‘’C.  Positive  clones 
were  identified  and  confirmed  by  PCR  using  the  754SP6  STS. 

Cosmids.  Cosmid  libraries  were  constructed  from  each  of  the  YACs.  Yeast  strains 
containing  YACs  were  grown  in  AHC  medium  (1.7  g/1  yeast  nitrogen  base  (Difco), 
5  gA  ammonium  sulfate,  10  g/1  casein  hydrolysate.  20  gA  glucose,  20  mgA  adenine 
hemisulfaie:  pH  5.8  (27)).  High  molecular  weight  DNAs  were  prepared  from 
YAC-containing  yeast  strains  by  centriftigarion  through  a  sucrose-step  gradient 
using  the  methodology'  of  Carle  and  Olson  (31).  Cosmid  libraries  were  constructed 
from  total  yeast  DNAs  as  described  by  Tokino  (32).  The  resultant  cosmid  libraries 
were  plated  at  a  density  of  15-20  clones/cm-,  and  clones  containing  human 
DNA  were  identified  by  colony  hybridization  using  human  Cot-1  DNA  (Life 
Technologies,  Inc.  MD)  as  a  probe.  Selected  cosmids  were  sequenced  with  T3 
and  T7  primers  (Table  3)  to  make  end-specific  STSs  of  cosmid  inserts. 

Genotyping  methods 

Genotyping  of  family  members  for  the  markers  described  were  carried  out  at 
the  University  of  Utah  and  McGill  University  for  kindreds  2082  and  1813, 
respectively.  In  both  locations,  similar,  standard  PCR  protocols  and  conditions 
were  used  with  the  exception  that  ^^S-radiolabelling  was  used  at  McGill,  while 
^^P-labelling  was  performed  at  Utah.  All  autoradiographs  at  both  locations  were 
read  by  two"^ independent  observers  and  all  events  critical  to  recombinant  analysis 
were  repeated  to  verify  the  initial  results. 

Identification  of  STRs 

577?  idenrification  by  sequencing.  To  identify  CA  repeats  in  the  contig,  cosmids 
were  screened  by  hybridization  to  a  radiolabelled  (CA)io  oligonucleotide  probe. 
Both  positive  PI  and  cosmid  clones  containing  human  inserts  were  sequenced 
directly  with  two  sets  of  three  mixed  primers:  CAl  and  CA2  (Table  3).  The  first 


Table  3.  Primers  and  sequences 


Name 

Primer  Sequences 

Description 

IIL 

GGG  CTG  CAG  CAT  GTT  GGC  CAG  GCT  GGT  CT 

Inter-/! /w 

2IL 

GGA  GTG  CAG  TGG  CGC  GAT  CTC  TGC  AGA  CTG 

Inicr-Alu 

2IR 

CAG  TCT  GCA  GAG  ATC  GCG  CCA  CGC  ACT  GCA  GCC  TGG 

Inter-z4/u 

2MR 

GCA  CTG  CAG  CCT  GGG  CAA  CAG  AGC  GAG 

Imer-Alu 

PFYL 

CAA  GTT  GGT  TTA  AGG  CGC  AAG 

YAC  left  arm 

PFYR 

TCG  AAC  GCC  CGA  TCT  CAA  G 

YAC  right  arm 

YL 

TGG  AAG  GAC  CGG  ATA  ATT  ATT  TGG  AAT  CTC 

YAC  left  arm 

YR 

GTA  GAG  CTT  GAA  AGA  AAT  ATT  TCA  ACT  TGC 

YAC  right  arm 

T3 

ATT  AAC  CCT  CAC  TAA  AG 

Cosmid 

T7 

AAT  ACG  ACT  CAC  TAT  AG 

Cosmid,  PI,  BAC 

SP6 

GAT  TTA  GGT  GAC  ACT  ATA  G 

PI,  BAC 

CAl 

CA2 

RP-2 

RL.blunt 

RL.S3A 

RP-B 

ACA  CAC  ACA  CAC  ACA  CAC  N  (N  is  T  or  C  or  G) 

CAC  ACA  CAC  ACA  CAC  ACA  N 

TGA  GTA  GAA  TTC  TAA  CGG  CCG  TCA  TTG  TTC 

(PO4)  GAA  CAA  TGA  CGG  CCG  TTA  GAA  TTC  TAC  TCA  (NHj) 

(PO4)  GAT  CGA  ACA  ATG  ACG  GCC  GTT  AGA  ATT  CTA  CTC  A 
(PO4)  TGA  GTA  GAA  TTC  TAA  CGG  CCG  TCA  TTG 

Oligonucleotide  sequences  for  the  primers  are  listed  in  the  5'— 3'  direction. 

RP-2.blunt  is  RP-2  annealed  with  RL.blunt.  RP-2.S3A  is  RP-2  annealed  with  RL.S3A. 
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STR  primer  for  each  marker  was  designed  from  this  sequence  and  used  to  sequence 
back  through  the  CA  repeat.  The  second  primer  was  designed  so  that  the  marker 
would  generate  a  PCR  product  of  approximately  150  bp.  Both  primers  were 
designed  so  that  the  annealing  temperature  was  55  "*0.  This  primer  pair  was  then 
.creened  against  chromosome  17  somatic  cell  hybrids  to  ensure  that  it  mapped 
0  chromosome  17. 

STR  cloning  by  direct  capture.  For  each  PU  1-5  /tg  DNA  was  digested 
independentlv  with  5^«3AI  and  Haem.  0.5  /ig  of  each  digest  was  examined  on 
an  agarose  gel;  the  remaining  DNA  was  phenol -chloroform  extracted,  isopropanol 
precipitated,  and  redissolved  at  a  concentration  of  50  ngZ/rl,  The  double  stranded 
^chor  adapter  R?-2.blunt  (Table  3)  was  ligated  to  the  Haem  digested  P 1  DNA. 
Similarly,  the  double  stranded  anchor  adapter  RP-2.S3A  (Table  3)  was  ligated 
to  the  5^r^IIIA  digested  PI  DNA.  The  two  ligation  reactions  from  each  PI  were 
combined  and  fractionated  over  a  sepharose  CL-6B  column  to  remove  excess 
anchor  adapters.  Approximately  25  ng  of  anchored  PI  DNA  was  hybridized  in 
50  ij\  of  20  mM  Hepes  pH  7.5,  1  mM  EDTA,  3  M  tetramethylammoniura  chionde, 
;br  60  min  at  50^0  with  mixmres  of  23mer  biotinylated  microsatellite  repeat 
oligonucleotides  (each  oligo  at  0.1  nM)  (e.g.  biotin-CA:  B'ra  CAC  ACA  CAC 
AC  A  CAC  ACA  CA  where  B  =  biodn  amidite).  Oligos  tor  CA,  GA,  AAC, 
AAG.  AAT,  ACC,  AGC,  AGG,  TCA,  CGC,  AAAC  ,  AAAG,  AAAT,  AAGG, 
and  AGGG  were  used  in  this  study.  PI  DNA  fragments  hybndized  to  the 
biotinylated  oligos  were  captured  on  streptavidin  paramagnetic  particles  (Dyna), 
resuspended  in  100  id  of  20  mM  Hepes  pH  7.5,  1  mM  EDTA,  3  M 
tetiamethylammonium  chloride,  and  incubated  for  10  min  at  55^C.  The  bioonylat^ 
oli<»os  and  hybridized  PI  DNAs  were  then  captured  and  washed  twice  with  100 
al  of  10  mM  Tris  pH  8.2,  50  mM  KQ,  O.l  mM  EDTA,  and  resuspended  m 
10  /tl  of  5  mM  Tris  pH  8.2,  0.1  mM  EDTA.  1.5  ^il  aUquois  of  the  resuspended 
beads  and  captured  DNAs  were  PCR  ampUfied  using  the  primer  RP-B,  The  PCR 
reactions  were  run  out  on  agarose  gels.  Microsatellite  repeat  bearing  bands  were 
cut  out  of  the  gels,  purified  on  Qiaex  beads  (Qiagen),  ligated  into  pUC  18,  cloned, 
and  sequenced. 

Restriction  mapping  of  PI  and  BAC  clones 

DNAs  were  isolated  from  a  1.5  mi  overnight  culture  of  each  strain  (33).  The 
DNA  was  digested  with  £coRI  and  HindSl  separately,  labelled  with  P-dATP 
by  filling-in  with  the  Klenow  fragment  oiEcoli  DNA  polymerase  I,  and  separated 
in  a  0.6%  agarose  gel  by  electrophoresis.  The  gel  was  then  dried  onto  DE81 
filter  paper  under  vacuum  and  exposed  to  Kodak  X-ray  film.  The  computer 
program  GEL  by  J.-M.Lacroix  was  used  to  assign  fragment  sizes.  Size  was 
determined  bv  summing  the  restriction  fragments,  excluding  the  vector  fragments. 
Overlap  between  Pis  and  BACs  was  calculated  by  summing  the  restncaon 
fragments  in  common. 


Pulsed  field  gel  restriction  mapping 

MRC5  normal  human  fibroblasts  and  human  mammary  epithelial  cells  were 
embedded  in  I  %  low  melt  agarose  at  a  concentration  of  2.5  x  10  cells/nd  using 
gel  syringes  (NEB).  Hardened  agarose  rods  were  extruded  into  10  ml  of  digestion 
Solution  consisting  of  1.0  mg/mi  proteinase  K  (Boehringer),  100  mM  EDTA, 
and  1  %  zV-laurylsarcosine  (Sigma).  Rods  were  incubated  at  50° C  for  48  h  at 
which  time  they  were  chilled  on  ice  and  dialyzed  three  times  in  50  mM  EDTA 
in  a  volume  of  50  ml/wash.  Plugs  were  stored  in  the  gel  syringes  at  4°C.  Twenty 
microliter  slices  of  the  embedded  DNA  were  cut  from  the  syringes  and  dialyzed 
in  TE  for  1  h  on  ice.  This  solution  was  removed  and  replaced  with  200  id  of 
the  appropriate  1  x  restriction  buffer  and  dialyzed  on  ice  for  30  min.  This  solution 
was  replaced  with  1 X  restriction  buffer  containing  40  to  100  units  of  Notl  or 
Nrui  and  further  incubated  for  30  min  on  ice.  Digestions  were  at  37  C  for  6 
h.  The  restriction  buffer  was  removed  and  replaced  with  I  mi  of  ice  cold  gel 
running  buffer  (0.5XTBE). 

Gels  were  run  on  either  a  Pulsphor  (LKB)  or  CHEF  DRH  unit  (BioRad). 
Agarose  gels  (1  %)  in  0.5  xTBE  were  run  in  0.5  xTBE  at  a  lemperamre  of  14°C. 
Running  conditions  varied  with  fragment  size  being  resolved,  but  consisted  of 
continuously  ramped  pulses  running  at  200  V.  Markers  consisting  of  yeast 
chromosomes  or  lambda  DNA  were  purchased  from  NEB.  Gels  were  stained 
in  0.5  Mg/ml  EtBr  for  30  min  and  destained  in  distilled  water  for  30  min.  Gels 
were  photographed  and  nicked  in  a  UV  Stratalinker  2400  (Stratagen®!)  on  the  auto¬ 
crosslink  setting.  Gels  were  then  placed  in  alkali  (0.5  M  NaOH,  1.5  M  NaCl) 
for  30  min,  transferred  via  capillary  blotting  onto  GeneScreen  Plus  (DuPont)  in 
the  same  solution  for  24  h,  and  neutralized  in  0.5  M  Tris/3.0  M  NaCl.  Filters 
were  hybridized  to  radiolabelled  probes  in  Hybrisol  II  buffer  (Oncor),  washed 
to  2xSSC/l%SDS  at  65°C,  and  exposed  to  film  at  -70°C  for  1-4  days. 
EDH17B2.  RNU2,  and  PPY  probes  were  generated  by  PCR  with  primers 
described7n  the  Chromosome  17  Workshop  Report  (34).  The  remaining  probes 
used  to  construct  this  map  were  cDNAs  isolated  by  solution  hybrid  capture  (20) 


during  the  course  of  generating  a  transcript  map  of  the  BRCAl  region  (P.A.Futreal 
et  al.,  in  preparation).  The  map  (Fig.  2b)  was  constructed  by  aligning  films  of 
consecutive  probings  and  measuring  fragment  sizes. 

Electronic  information  access 

DNA  primer  sequences  and  sizes  of  the  products  are  available  from  GDB.  This 
information  may  also  be  obtained  directly  from  the  authors  via  anonymous  ftp 
to  morgan.med.  Utah,  edu  (128.110.62,50)  in  the  directory  pub/STS. 
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A  detailed  transcription  map  of  the  320-kb  region 
containing  the  HSD17B  locus  on  chromosome  17  was 
generated.  Thirty  unique  cDNA  fragments,  retrieved 
following  the  hybridization  of  immobilized  YACs  to 
primary  pools  of  cDNAs  prepared  from  RNA  of  mam¬ 
mary  gland,  ovary,  placenta,  and  the  Caco-2  cell  line, 
were  aligned  into  10  transcription  units  by  physical 
mapping  and  hybridization  to  RNAs  of  a  series  of  tis¬ 
sues.  The  cDNAs  were  then  further  characterized  by 
sequencing  and  used  to  screen  mammary  gland  cDNA 
libraries.  Fragments  corresponding  to  the  broadly  ex¬ 
pressed  y-tubulin  and  Ed  antigen  genes  were  identi¬ 
fied.  A  full-length  cDNA  clone  encoding  a  117-amino- 
acid  protein  homologous  to  the  rat  ribosomal  protein 
L34  was  isolated.  Portions  of  genes  with  restricted  pat¬ 
terns  of  expression  were  also  obtained,  including  the 
previously  characterized  jHrSD17Bi.  One  new  gene,  for 
which  a  full-length  cDNA  was  isolated,  was  found  to 
have  an  interesting  tissue-specific  pattern  of  expres¬ 
sion  with  abundant  mRNA  in  both  the  colon  and  the 
testis  and  in  the  mammary  carcinoma  cell  line  BT-474. 
This  contrasted  with  the  barely  detectable  level  ob¬ 
served  in  several  tissues  including  normal  mammary 
gland.  Of  the  five  additional  transcription  units  identi¬ 
fied,  one  showed  no  similarity,  two  showed  identity  to 
human  expressed  sequences,  and  two  displayed  simi¬ 
larity  to  genes  of  animal  species  by  amino  acid  align¬ 
ment.  These  latter  cDNA  clones  include  potential  ho- 
mologues  of  a  rat  nuclear  tyrosine  phosphatase  and  of 
a  factor  of  Drosophila  that  is  known  to  be  involved 

Sequence  data  from  this  article  have  been  deposited  with  the 
EMBL/GenBank  Data  Libraries  under  Accession  Nos.  L38932~941 
and  L39000. 
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INTRODUCTION 

The  refinement  of  the  breast  cancer  susceptibility 
locus  within  17ql2-q21  by  genetic  linkage  (Hall  et  aL, 
1990;  Narode^  aZ.,  1991;  Easton  eZaZ.,  1993)  and  recom¬ 
bination  mapping  studies  (Anderson  et  aL,  1993;  Bow- 
cock  et  aL,  1993;  Chamberlain  et  aL,  1993;  Kelsell  et 
aL,  1993;  Simard  et  aL,  1993;  Albertson  et  aL,  1994a; 
Goldgar  et  aL,  1994;  Smith  et  aL,  1994;  Tonin  et  aL, 
1994)  has  recently  led  to  the  identification  of  the 
BRCAl  gene  at  D17S855  (Miki  et  aL,  1994;  Futreal  et 
aL,  1994a).  Germline  mutations  have  been  detected  in 
linked  families  in  this  putative  transcription  factor 
(Futreal  et  aL,  1994a;  Castilla  et  aL,  1994;  Simard  et 
aL,  1994;  Friedman  et  aL,  1994a).  Noted  features  of 
these  families  include  premenopausal  onset  of  breast 
cancer  and  high  incidences  of  ovarian  cancer.  In  spo¬ 
radic  cases,  the  involvement  of  additional  genes  on  the 
long  arm  is  suspected  (Vogelstein  and  Kinzler,  1994), 
as  studies  performed  directly  on  breast  tumors  indicate 
that  a  range  of  chromosome  17  abnormalities  and  re¬ 
arrangements  occur  in  a  significant  proportion  of  pa¬ 
tients  (Sato  et  aL,  1991;  Kallioniemi  et  clL,  1994).  This 
has  been  supported  by  the  high  frequency  of  loss  of 
heterozygosity  in  the  BRCAl  region  (Eccles  et  aL,  1990; 
Futreal  et  aL,  1992;  Smith  et  aL,  1992;  Saito  et  aL, 
1993;  Cropp  et  aL,  1994;  Neuhausen  and  Marshall, 
1994).  These  changes  could  involve  genes  that  are  not 
necessarily  causal,  but  may  be  involved  in  the  progres¬ 
sion  of  breast  cancers.  Additional  and  refined  mapping 
of  17q  and  delineation  of  the  minimal  intervals  where 
alterations  occur  are  needed. 

High-density  maps  of  physical  markers  and  a  num¬ 
ber  of  candidate  genes  have  been  obtained  from  17ql2- 
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q21  (Albertson  et  al,  1994b;  Campbell  et  al,,  1994;  Fu- 
treai  et  al,  1994b;  Kamb  et  al,  1994;  Neuhausen  et  al, 
1994).  A  detailed  transcription  map  of  the  region  just 
telomeric  oiBRCAl  (Jones  et  al,  1994)  and  preliminary 
data  from  a  broader  map  have  recently  been  reported 
{Friedman  et  al,  1994b).  The  HSD17B  locus  is  centro- 
meric  of  BRCAl  and  was  known  to  include  at  least  two 
genes,  one  of  which  is  likely  to  be  a  processed  pseu¬ 
dogene  (Luu-The  et  al,  1989,  1990).  The  active  gene, 
HSD17B1  (formerly  known  as  EDH17B2),  catalyzes 
the  interconversion  of  the  weak  estrogen,  estrone,  to 
the  more  potent  17/3  estradiol.  Its  known  biological  role 
in  the  regulation  of  estrogen,  the  physical  location,  and 
the  expression  pattern  led  initially  to  consideration  of 
this  gene  as  a  BRCAl  candidate;  however,  affected  in¬ 
dividuals  from  families  that  showed  chromosome  17 
■mkage  to  breast  cancer  did  not  reveal  sequence  alter¬ 
ations  (Simard  et  al,  1993;  Kelsell  et  al,  1993). 

We  sought  to  develop  a  detailed  transcription  map 
with  YACs  from  the  locus.  The  procedures  used  to  iden¬ 
tify  cDNAs  involved  hybridization  of  mixtures  of  cDNA 
libraries  to  immobilized  genomic  DNA  to  permit  the 
rapid  identification  of  genes  by  a  modified  direct  selec¬ 
tion  procedure  (Lovett  et  al,  1991;  Parimoo  et  al, 
1991).  A  minimum  of  10  transcription  units  were  iden¬ 
tified,  only  5  of  which  had  been  previously  identified 
as  human  genes.  All  of  these  units  are  confined  to  a 
320-kb  region  of  genomic  DNA  positioned  proximal  to 
D17S855  {BRCAl)  and  distal  to  D17S776. 

MATERIALS  AND  METHODS 

Preparation  and  isolation  of  genomic  DNAs.  HSD  1  and  HSD2 
YAC  clones  were  isolated  from  pools  of  CEPH  YACs  by  application 
of  the  polymerase  chain  reaction  (PCR)  to  3deld  a  product  of  236  bp 
with  the  oligonucleotides  EDI  (5'  TTGGCCGTACGTCTGGCTTCA) 
and  ED2  (5'  GTCCAGCTGCAACGTCTCCAG)  o(HSD17Bl.  Positive 
clones  were  verified  by  hybridization  with  the  cDNAs,  hpE2DH216 
and  hpEsDHSe  (Luu-The  et  al,  1989,  1990).  Yeast  cultures  were 
propagated,  and  rapid  genomic  (Scherer  and  Tsui,  1991)  and  total 
chromosome  DNA  was  isolated  as  described  (Rommens  et  al,  1994), 
Separated  artificial  chromosomes  were  obtained  by  electrophoresis 
with  the  CHEF-DR  II  pulsed-field  electrophoresis  system  (Bio-Rad 
Labs)  under  standard  conditions.  They  were  transferred  to  Hybond- 
N  nylon  membrane  (Amersham)  and  immobilized  by  UV  cross  linking 
for  the  selection  experiments  (Rommens  et  al,  1994). 

Genomic  DNAs  were  isolated  from  human  leukocytes  or  from  ro¬ 
dent  and  rodent  hybrid  cell  lines  by  established  procedures  (Miller 
et  al,  1988).  The  mouse-human  hybrid  cell  line  GM10498A  was 
obtained  from  the  NIGMS  Human  Genetic  Mutant  Cell  Repository, 
Camden,  New  Jersey.  Restriction  digestions  were  carried  out  ac¬ 
cording  to  the  enzyme  supplier’s  recommendations.  Electrophoresis, 
blotting,  and  hybridization  with  Hybond-N  membrane  (Amersham) 
were  carried  out  with  standard  procedures  (Sambrook  et  al,  1989). 
The  blots  were  washed  with  the  stringency  achieved  with  0.2  X  SSC 
with  0.1%  SDS  at  60°C.  The  blots  were  exposed  to  X-Omat  film  (Ko¬ 
dak)  for  2-72  h  at  -70‘^C  with  one  intensifying  screen. 

For  long-range  physical  mapping,  high-molecular- weight  DNA  em¬ 
bedded  in  agarose  of  the  cell  line  GM10498A  or  of  whole  yeast  chro¬ 
mosomes  was  digested  with  rare-cutting  restriction  enzymes.  Prod¬ 
ucts  were  separated  by  pulsed-field  gel  electrophoresis  and  trans¬ 
ferred  to  nylon  membrane  for  hybridizations  as  previously  described 
(Church  and  Gilbert,  1984;  Rommens  et  al,  1989). 


cDNA  preparation  and  selection.  Randomly  primed  cDNA  was 
prepared  from  poly(A)'^  RNA  of  mammary  gland,  ovary,  and  placenta 
tissues  and  from  total  RNA  of  the  Caco-2  cell  line  (ATCC  HTB  37), 
and  selection  was  carried  out  for  two  consecutive  rounds  of  hybridiza¬ 
tion  to  immobilized  artificial  chromosome  DNA  of  HSDl,  HSD2,  and 
HSD3  as  described  previously  (Rommens  et  al,  1993,  1994).  Hy¬ 
bridizing  cDNA  was  collected,  passed  over  a  G50  Fine  Sephadex 
column,  and  amplified.  The  products  were  then  digested  with  EcoRl, 
size  selected  on  agarose  gels,  and  ligated  into  pBluescript  (Stra- 
tagene)  that  had  been  digested  with  EcoRl  and  treated  with  calf 
alkaline  phosphatase  (Boehringer  Mannheim).  Ligation  products 
were  transformed  into  competent  DH5a  Escherichia  coli  cells  (Life 
Technologies,  Inc.), 

Characterization  of  retrieved  cDNAs.  Individual  colonies  from 
each  ligation  were  picked  and  gridded  onto  an  agar  plate  containing 
100  pg  ampicillin  for  ordering,  prescreening,  and  storage.  Grown 
colonies  were  transferred  onto  Biotrans  membranes  (ICN)  and  hy¬ 
bridized  with  radiolabeled  cDNA  and  HSD17B1  cDNA  segments. 
Plasmids  from  20  to  40  clones  from  each  selection  experiment  that 
did  not  hybridize  in  the  prescreenings  were  isolated  for  further  analy¬ 
sis.  Of  clones  tested,  approximately  85%  mapped  appropriately  to 
the  starting  YAC  clones.  The  membranes  of  the  grids  were  also  re¬ 
peatedly  hybridized  with  individual  cDNA  clones  to  determine  over¬ 
lapping  or  redundant  clones. 

RNA  preparation  and  hybridization  analysis.  Total  RNA  was  iso¬ 
lated  by  centrifugation  through  a  CsCl  cushion  following  tissue  dis¬ 
ruption  in  a  Polytron  homogenizer  or  resuspension  of  pelleted  cells 
from  cultured  lines  in  guanidinium  isothiocyanate  (MacDonald  et  al, 
1987).  Poiy(A)'"  RNA  or  blots  containing  poiy(A)‘"  RNA  were  pur¬ 
chased  and  used  according  to  the  supplier’s  recommendation  (Clon- 
tech  Labs).  For  hybridization  analysis,  RNA  was  fractionated  on 
agarose  gels  (1%)  containing  0.6  M  formaldehyde  and  transferred  to 
Hybond-N  membrane  (Amersham).  Following  cross-linking  with  UV 
radiation,  blots  were  repeatedly  hybridized  with  cDNA  fragments 
radiolabeled  by  random  priming  (Feinberg  and  Vogelstein,  1983). 
Final  conditions  for  washing  were  as  for  genomic  blots.  Blots  were 
exposed  to  X-Omat  AR  film  for  1-12  days  with  a  single  intensifying 
screen  at  -70°C.  Probes  were  removed  between  hybridizations  by 
heating  membranes  to  95°C  in  0.02X  SSC  and  0.01%  SDS. 

cDNA  library  screening.  To  provide  a  preliminary  screen,  PCR 
was  performed  using  specific  oligonucleotides  derived  from  the  se¬ 
quences  of  each  of  the  retrieved  cDNA  clones  (individual  clones  were 
named  GT  (number))  with  DNA  from  four  \gtll  cDNA  libraries 
(Clontech  Labs)  prepared  from  human  mammary  gland  (HL  1037b), 
human  breast  (HL  1061b),  ZR-75-1  human  breast  cancer  cells  (HL 
1059b),  and  human  ovary  (HL  1098b).  Subsequently,  13  cDNA  frag¬ 
ments  were  radiolabeled  and  used  to  screen  0.5— 2.0  X  10®  pfu  as 
indicated  in  Table  1.  Prehybridization  and  hybridization  were  per¬ 
formed  as  followed  at  42°C  in  50%  formamide,  5x  SSPE,  0.1%  SDS, 
5x  Denhardt’s  mixture,  0.2  mg/ml  denatured  salmon  testis  DNA, 
and  0.002  mg/ml  poly(A).  Dextran  sulfate  (4%,  v/v)  was  included  in 
the  hybridization  solution.  Membranes  were  washed  under  the  same 
conditions  as  those  for  the  genomic  blots  and  exposed  overnight  at 
— 80°C.  The  positive  phages  were  retested  for  a  second  and  third 
screening,  if  necessary,  to  obtain  purified  plaques  for  subcloning  into 
pBluescript  SK  (Stratagene)  vector.  PCR  was  applied  to  the  positive 
clones  with  the  universal  oligonucleotides  of  the  Xgtll  vector  and 
with  the  oligonucleotides  of  the  GT  clones  to  determine  insert  size. 

DNA  sequencing  and  sequence  analysis.  The  sequences  of  both 
strands  of  the  retrieved  cDNAs  were  obtained  wnth  the  dideoxy  chain 
termination  method  (Sanger  et  al,  1977)  using  the  T7  sequencing 
kit  (Pharmacia  Biotech  Inc.).  Initial  sequence  analysis  including  as¬ 
sembly,  cross  alignment,  and  identification  of  open  reading  frames 
was  performed  using  the  Microgenie  sequence  analysis  software  (In- 
telliGenetics,  Inc.).  The  sequences  were  then  subjected  to  alignment 
searches  through  the  nonredundant  GenBank/EMBL  and  dbEST  da¬ 
tabases  using  GCG’s  WORDSEARCH  and  FastA  programs  from  the 
University  of  Wisconsin  and  the  BLAST  programs  from  the  National 
Center  for  Biotechnology  Information  (NCBI)  (Altschul  et  al,  1990) 


532 


ROMMENS  ET  AL. 


using  the  NCBI  e-mail  BLAST  servers.  Those  cDNAs  that  were  not 
identical  to  previously  characterized  cDNAs  of  proteins  of  known 
function  were  tested  for  the  presence  of  putative  exons  using  GRAIL 
(Gene  Recognition  Analysis  Internet  Link)  (UnerBacher  and  Mural, 
1991;  Shah  et  aL,  1994).  Potential  coding  regions  were  examined  for 
known  peptide  motifs  using  the  BLOCKS  database  (version  8.0)  e- 
mail  searcher  at  the  Fred  Hutchinson  Cancer  Research  Center  (Heni- 
koff  and  Henikoff,  1994).  Sequences  from  the  GT  clones  and  the 
corresponding  longer  cDNA  clones  were  submitted  to  the  dbEST  da¬ 
tabase  at  NCBI  under  the  GenBank  accession  numbers  listed  in 
Table  1. 


RESULTS 

Genomic  Clones 

The  region  surrounding  HSD17B  was  cloned  by 
screening  pools  of  the  CEPH  YAC  library  by  PCR  using 
oligonucleotides  corresponding  to  the  HSD17B1  cDNA 
(Luu-The  et  al,  1989).  Several  clones  were  found  to  be 
positive,  but  they  appeared  unstable,  with  significant 
rearrangement  and  internal  deletions  occurring  even 
within  single  growth  cycles.  Two  YAC  clones  were  con¬ 
firmed  by  hybridization  with  cDNAs,  279F8  (HSDl, 
210  kb)  and  368C7  (HSD2,  700  kb),  and  were  used  to 
retrieve  cDNAs  and  to  build  the  physical  map.  It  was 
determined  that  HSD2  was  chimeric  and  that  a  large 
portion  of  the  end  containing  the  right  YAC  arm  was 
found  to  originate  from  chromosome  16  (data  not 
shown).  An  additional  YAC  clone,  known  to  contain  the 
HSD17B  locus,  26F3  (HSD3)  of  290  kb  was  also  used 
for  analysis  and  mapping.  The  overlap  and  alignment 
of  these  YACs  are  shown  in  Fig.  1. 

Retrieval  of  cDNA  Fragments  and  Characterization 

by  Physical  Mapping 

A  pool  of  primary  cDNAs  prepared  from  poly(A) 
RNA  of  human  mammary  gland,  ovary,  and  placenta 
as  well  as  from  total  RNA  of  the  Caco-2  cell  line  was 
hybridized  to  purified  and  immobilized  YAC  clones 
(Rommens  et  al.,  1993,  1994). 

Initial  analysis  of  the  cDNA  clones  involved  a  pre¬ 
screen  for  ribosomal  sequences.  Approximately  10%  of 
the  clones  were  eliminated  as  they  hybridized  strongly 
with  radiolabeled  cDNA  obtained  from  total  RNA, 
Fewer  than  3%  of  the  remaining  clones  were  found 
to  be  derived  from  the  HSD17B  locus  as  identified  by 
hybridization  to  cDNA  fragments  of  HSD17B1  (Luu- 
The  et  aL,  1989).  One  positive  clone,  GT207,  was  ana¬ 
lyzed  by  sequencing  for  further  verification. 

The  retrieved  cDNA  fragments  that  were  not  elimh 
nated  in  the  prescreenings  were  then  verified  to  ori^- 
nate  from  the  starting  YAC  by  hybridization  to  restric¬ 
tion  digests  of  DNA  of  the  yeast  clones;  six  examples 
are  shown  in  Fig.  2A.  The  clones  were  tentatively  as¬ 
signed  into  groups  based  on  the  overlapping  and  non¬ 
overlapping  intervals  of  the  YAC  clones.  The  position 
of  each  clone  was  subsequently  verified  by  long-range 


physical  mapping  both  with  the  artificial  chromosomes 
and  with  genomic  DNA  (data  not  shown). 

That  each  of  the  cDNAs  also  originated  from  chromo¬ 
some  17  was  confirmed  by  hybridization  to  a  mouse 
hybrid  cell  line  that  contains  chromosome  17  as  its  only 
human  material;  six  examples  are  shown  in  Fig.  2B. 
With  one  exception,  the  EcoBl  restriction  fragments 
detected  in  the  yeast  DNA  corresponded  to  those  de¬ 
tected  in  the  chromosome  17  hybrid  and  in  human 
DNA.  The  cDNA  clone,  GT209,  displayed  an  interest¬ 
ing  pattern  that  is  shown  in  Fig.  2.  Subsequent  investi¬ 
gation  indicated  that  this  clone  mapped  to  the  left  arm 
boundary  and  was  present  only  partially  in  HSDl.  Fur¬ 
ther,  the  smallest  EcoRl  band  that  was  detected  in 
HSD2  and  HSD3  is  polymorphic  (data  not  shown).  Two 
groups  of  clones,  (1)  GT208  (shown  in  Fig.  2B),  GT214, 
GT217,  and  GT252  and  (2)  GT218,  GT298,  GT304,  and 
GT305,  and  the  singlet  clone  GT247  showed  strong  hy¬ 
bridization  to  chromosome  17  but  also  yielded  addi¬ 
tional  bands  with  comparable  or  weaker  intensity  in 
total  human  DNA. 

RNA  Hybridization 

Retrieved  cDNA  clones  were  hybridized  to  total  RNA 
of  a  series  of  different  tissues  and  cell  lines;  exampks 
are  shown  in  Fig.  3.  The  results  from  the  RNA  hybrid¬ 
ization  together  with  the  initial  physical  mapping  per¬ 
mitted  groups  of  cDNAs  to  be  assigned  to  individual 
transcription  units.  For  example,  GT208  and  GT217 
did  not  overlap  in  sequence  but  originated  from  a  com¬ 
mon  interval  of  the  HSD2  YAC  that  did  not  include 
portions  of  HSDl  or  HSD3  as  shown  in  Fig.  1.  Both 
hybridized  to  a  common  mRNA  species  of  3.2  kb  in 
several  tissues,  as  shown  in  Fig.  3.  Subsequently,  a 
longer  cDNA  clone  identified  from  a  mammary  gland 
library  confirmed  the  unit  overlap  as  it  contained  both 
of  the  smaller  cDNA  fragments. 

Comparable  analyses  of  each  of  the  cDNA  fragments 
yielded  evidence  for  at  least  10  transcription  units 
within  320  kb  of  genomic  DNA.  The  groups  of  clones 
and  their  relative  map  positions  of  the  individual  units 
have  been  summarized  in  Fig.  1  and  are  shown  num¬ 
bered  1  through  10.  The  precise  order  for  transcripts  4 
and  5  or  for  7  and  8  could  not  be  determined  from  the 
long-range  mapping  information  obtained.  Transcrip¬ 
tion  unit  9  corresponded  to  HSD17B1  (Luu-The  et  al., 
1989).  RNA  hybridization  for  transcription  units  1,  2, 
3,  6,  7,  8,  and  10  is  shown  in  Fig.  3.  mRNAs  correspond¬ 
ing  to  GT219,  GT227,  GT232,  and  GT235  were  not  ob¬ 
served  by  RNA  hybridization  analysis,  so  no  corre¬ 
sponding  transcription  units  were  assigned;  Three  of 
these  four  clones  map  to  the  same  physical  interval  as 
shown  in  Fig.  1, 

One  group  of  clones,  including  GT218  and  GT305, 
hybridized  to  at  least  two  discrete  messages  of  4.0  and 
1,9  kb.  Further,  the  clone  GT305  also  detected  a  mRNA 
of  1.0  kb  specifically  in  the  brain;  see  Fig.  3.  These 
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FIG.  1.  Physical  and  transcript  map  at  the  HSD17B  locus.  The  overlapping  YAC  clones  containing  the  HSD17B  locus  and  the  detailed 
physical  map  that  was  generated  are  shown  aligned  to  a  long-range  physical  map  of  human  DNA.  The  positions  of  the  restriction  sites  for 
Mlul  Notl,  andNrul  were  determined  by  single  and  double  digestion  combinations.  Because  of  absence  of  methylation,  additional  restriction 
sites  were  noted  in  the  YACs.  The  map  interval  of  each  cDNA  clone  is  indicated  by  the  horizontal  lines  shown  dravm  above  lasted 
clones.  These  intervals  were  determined  by  overlap  of  the  three  YACs  or  by  the  analysis  of  additional  subclones  of  the  HSp2  YAC  that 
contained  deletions;  only  the  longest  subclone  of  HSD2  is  shown.  Map  position  and  RNA  hybridization  results  were  used  to  obtain  the 
transcript  map  shown  below  the  groups  of  clones.  Pound  signs  and  asterisks  indicate  that  transcript  order  within  the  single  physica  intervals 
could  not  be  determined.  The  clones  GT219,  GT227,  GT232,  and  GT235  shown  enclosed  by  brackets  were  not  assigned  to  transcription  units. 


mRNAs  were  grouped  as  transcription  unit  10.  Al¬ 
though  these  clones  hybridized  very  strongly  to  single 
EcoRl  and  Notl  restriction  fragments  in  YAC  genomic 
DNA,  a  more  complex  hybridization  pattern  was  ob¬ 
served  with  human  DNA.  Hybridization  to  a  hybrid 
mapping  panel  indicated  that  at  least  one  copy  of  these 
selected  sequences  did  originate  from  chromosome  17 
but  that  at  least  four  additional  copies  were  present 
elsewhere  in  the  genome,  specifically  on  chromosomes 
1,  6,  11,  and  12  (data  not  shown). 

Two  selected  cDNAs,  GT211  and  GT198,  that  corre¬ 
spond  to  transcription  units  5  and  7  were  also  assessed 
by  hybridization  to  poly(A)*^  RNA,  as  shown  in  Fig.  4,  to 
obtain  a  more  complete  analysis  of  pattern  of  expression. 
They  are  described  in  subsequent  sections.  The  two 
mRNAs  of  5.0  and  6.5  kb  of  transcription  unit  5  (Fig.  4A) 
that  are  detected  with  GT211  may  be  due  to  alternative 
splicing  or  to  differential  polyadenylation  but  are  likely 


to  be  derived  from  a  single  transcriptional  unit.  The 
pattern  of  hybridization  to  DNA  with  GT211  was  com¬ 
plex  (see  Fig.  2B)  but  was  consistent  with  a  single  loca¬ 
tion  on  chromosome  17.  The  complexity  resulted  from 
the  strong  cross-hybridizing  bands  of  rodent  DNA  and 
because  at  least  five  exons  were  present  in  the  GT211 
cDNA  fragment.  Each  of  the  human  bands  was  present 
on  chromosome  17  and  on  both  HSDl  and  HSD2  YACs, 
but  was  absent  on  HSD3;  see  Fig.  2 A. 

Longer  cDNA  clones  were  obtained  for  selected  cDNA 
fragments  as  summarized  in  Table  1.  Presence  in  the 
cDNA  libraries  was  initially  established  by  amplifica¬ 
tion  with  specific  oligonucleotides  designed  from-the 
sequences  of  the  selected  fragments, 

cDNAs  of  Three  Known  Human  Genes 

The  sequences  of  selected  cDNAs  and  of  isolated  cor¬ 
responding  cDNA  clones  were  determined  and  ana- 
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FIG.  2.  Verification  of  origin  of  selected  cDNAs  to  genomic  DNA.  Six  cDNAs  are  shown  hybridized  to  (A)  0.5  fig  of  total  genomic  yeast 
DNA  of  YAC  clones  HSDl,  HSD2,  and  HSD3  and  (B)  10  fig  of  mammalian  DNA  that  had  been  digested  with  EcoRl  restriction  enzyme. 
The  DNAs  were  separated  on  0.7%  agarose  gels  and  transferred  to  nylon  membrane.  The  lanes  between  the  labeled  yeast  DNAs  are  control 
samples  that  correspond  to  YAC  clones  that  do  not  overlap  with  the  HSD17B  locus.  The  mammalian  DNA  was  prepared  from  human 
leukocyte,  mouse-human  hybrid  (GM10498A),  and  mouse  L  cell  line.  The  hybrid  mouse  cell  line  contains  chromosome  17  as  its  only  human 
material.  cDNAs  (e.g.,  GT252)  could  not  be  assigned  definitively  to  chromosome  17  when  hybridizing  bands  appeared  in  human  DNA  that 
did  not  appear  in  the  hybrid  ceil  line.  Cross-species  hybridization  is  noted  with  several  of  the  retrieved  cDNAs. 


lyzed  for  coding  potential  and  compared  to  sequences 
of  the  public  databases.  Three  of  the  10  transcription 
units  could  be  readily  identified;  they  corresponded  to 
the  genes  for  the  Ki  nuclear  autoantigen  (Nikaido  et 
aL,  1990;  Albertson  et  aL,  1994b),  y-tubulin  (Zheng  et 
al,  1991),  and  HSD17B1  (Luu-The  et  aL,  1989).  One 
hundred  percent  identity  of  sequence  was  observed  be¬ 
tween  the  retrieved  cDNA  fragments  and  the  human 
gene  sequences  available.  The  details  are  given  in  Ta¬ 
ble  1.  These  genes  are  shown  as  transcription  units  1, 
6,  and  9,  respectively,  in  Fig.  1. 

One  interesting  feature  was  noted  for  the  mRNA  that 
corresponded  to  the  y-tubulin  gene.  The  1.8-kb  tran¬ 
script,  detected  by  GT234  in  Fig.  3,  was  found  in  all  RNA 
samples  tested  but  appeared  as  a  doublet  band  in  the 
brain.  This  suggests  the  existence  of  a  tissue-specific  al¬ 
ternative  splicing  or  dijfferential  polyadenylation. 

Identification  of  Human  Homologues  of  Two  Rat 

Genes 

Tentative  identification  of  two  other  transcription 
units,  4  and  10,  could  be  achieved  by  alignment  of  trans¬ 


lated  cDNA  sequences  to  rat  genes.  The  nucleotide  se¬ 
quence  of  the  predicted  coding  region  of  GT247,  tran¬ 
scription  unit  4,  shares  90%  identity  with  that  of  the 
rat  ribosomal  gene,  L34.  Significant  similarity  was  also 
observed  between  the  117-amino-acid  putative  peptide 
encoded  and  the  117-amino-acid  peptide  o{L34  (Aoyama 
et  aL,  1989),  With  the  exception  of  the  13  consecutive 
residues  from  amino  acid  position  75  through  and  in¬ 
cluding  position  88,  due  to  a  deletion  of  a  T  at  codon  74 
and  an  insertion  of  a  T  at  codon  88,  only  5  of  the  re¬ 
maining  104  amino  acids  were  not  identical.  Further, 
as  shown  in  Fig.  5,  the  positions  of  the  initiation  and 
termination  codons  were  identical. 

The  sequence  corresponding  to  transcription  emit  10, 
which  was  compiled  from  overlapping  sequences  of 
GT218,  GT226,  GT298,  GT304,  and  GT305^and  several 
cDNA  clones  isolated  from  a  mammary  gland  cDNA  li¬ 
brary,  shared  72%  sequence  identity  with  the  coding 
region  of  a  rat  liver  tyrosine  phosphatase  cDNA  (Dia¬ 
mond  et  aL,  1994).  Translation  of  the  composite  se¬ 
quence  yielded  an  apparent  incomplete  open  reading 
frame  of  154  amino  acids.  Ninety  percent  were  identical 
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123456 


28S  — 
18S  — 


^3.2kB 


-^3.2kB 


GT208 

123456 

28S —  -^4.0kB 

183 —  ••*>*  *  <-1.9kB 


GT218 

123456 

mmmrnmm  <‘2.4k8 

GT338 
1  23  4  5  6 


28S- 

18S- 


GT217 
1  2  3  4  5  6 


_  — -^4.0kB 
,  ^  ^  W  fv*  -^1 .9kB 

-<-1.0kB 

GT305 

123456 

■■■  '''^4 

GT198 

123456 


corr6sponding  to  th©  first  42  and  th.6  last  50  amino 
acids  in  the  alignment  shown  in  Fig.  6,  57  and  70% 
identities  were  observed.  Also,  the  latter  region  of  iden¬ 
tity  included  a  putative  nuclear  localization  signal  as 
indicated,  followed  by  the  consistent  spacing  and  con¬ 
servation  of  5  of  5  cysteines  within  the  following  41 
amino  acids.  Portions  of  this  human  homologue  have 
also  been  reported  in  a  preliminary  map  generated  by 
others  (Friedman  et  aL,  1994b). 

E(z)  in  D.  melanogaster  is  involved  in  the  negative 
regulation  of  the  segment  identity  genes  of  the  Anten- 
napedia  (ANT-C)  and  Bithorax  (BX-C)  gene  complexes. 
This  gene  exhibits  two  mRNA  species,  2.4  and  3.3  kb, 
that  are  regulated  in  a  development-specific  manner. 
The  smaller  mRNA  is  expressed  during  embryo,  pupae, 
and  adult  stages,  while  the  larger  3.3-kb  mRNA  is  not 
expressed  in  the  embryo,  but  is  expressed  in  the  pupae 
and  adult  stages  with  higher  abundance  in  males 
(Jones  and  Gelbart,  1993).  These  transcripts  are  nota¬ 
bly  smaller  than  the  two  mRNAs  of  5  and  6.5  kb  de¬ 
tected  with  GT211  in  many  adult  tissues  (Fig.  4A). 
There  appears  to  be  some  tissue-specific  regulation  of 
the  individual  human  messages;  for  example,  skeletal 
muscle  displayed  the  smaller  mRNA  most  prominently, 
while  nearly  equal  levels  were  detected  in  spleen  and 
thymus.  Expression  of  this  gene  was  also  observed  in 
fetal  tissues  including  brain,  lung,  liver,  and  kidney 
(data  not  shown). 


28S—  -^4.5kB 

.i.85kB  . 

1 8S  —  .8kB  '  ■  ■ 


GT234  GT236 

FIG.  3.  Hybridization  of  cDNAs  to  total  RNA.  Nine  retrieved 
cDNAs  are  shown  hybridized  to  10  /ig  of  total  RNA  of  (1)  Caco- 
2  (intestinal  cell  line),  (2)  BT-474  (breast  carcinoma  cell  line),  (3) 
fibroblast,  (4)  HL60,  (5)  lymphoblast,  and  (6)  frontal  cortex  of  adult 
brain  as  indicated.  They  demonstrate  7  of  the  10  transcription  units 
shown  in  Fig.  1.  The  integrity  of  the  RNA  is  shown  in  the  gel  stained 
with  ethidium  bromide  (upper  left).  The  positions  of  the  ribosomal 
RNAs,  18S  and  28S,  are  indicated  to  the  left.  Sizes  of  the  transcripts 
in  kilobases  (kb)  are  given  to  the  right  of  each  panel. 

to  the  corresponding  portion  of  the  173  amino  acids  of 
the  nuclear  tyrosine  phosphatase  of  the  immediate-early 
PRL-1  gene  of  rat  (Diamond  et  al,  1994).  These  results 
indicate  that  the  cDNAs  corresponding  to  transcription 
units  4  and  10  encode  the  human  homologues  for  the 
■’at  ribosomal  L34  and  PRL-1  genes,  respectively. 

Identification  of  a  Putative  Transcription  Factor 

The  sequence  of  818  nucleotides  of  clone  GT211, 
which  identified  transcription  unit  5,  did  not  align  to 
human  or  mammalian  sequences.  Alignment  of  the 
translated  amino  acid  sequence  did  reveal  segments  of 
striking  amino  acid  similarity  with  translated  portions 
of  the  Enhancer  ofzeste  gene,  E(z),  of  Drosophila  mela- 
logaster  (Jones  and  Gelbart,  1993).  In  two  stretches 


Novel  Genes  and  Identities  with  Expressed  Sequence 

Tags 

Identification  of  the  roles  of  the  genes  corresponding 
to  the  four  remaining  transcription  units  was  limited,  as 
distinctive  peptide  motifs  were  not  recognized.  Transcrip¬ 
tion  unit  3,  detected  by  GT196  and  GT233,  corresponded 
to  a  small  mRNA  of  1.0  kb.  Neither  the  nucleotide  nor 
the  translated  amino  acid  sequences  aligned  in  the  pubhc 
databases,  despite  the  notably  abundant  expression  lev¬ 
els  observed  in  human  tissues  including  adult  brain  (Fig. 
3).  Further  characterization  was  confounded  by  observing 
that  all  nine  cDNA  clones  isolated  from  the  mammaiy 
gland  cDNA  library  were  chimeric. 

Segments  of  the  retrieved  cDNAs  of  transcription 
units  2,  7,  and  8  aligned  with  near  identity  to  expressed 
sequence  tags  determined  from  cDNA  libraries  of  the 
HL60  cell  line  or  of  a  72-day-old  human  female  brain, 
as  detailed  in  Table  1.  Discrete  mRNAs  of  2.4  and  4.5 
kb  were  detected  for  transcription  units  2  and  8,  respec¬ 
tively.  Both  appear  to  be  relatively  abundant  in  all 
tissues  examined;  see  the  hybridization  results  with 
GT338  and  GT236  in  Fig.  3.  The  full-length  cDNA  cor¬ 
responding  to  transcription  unit  7  is  discussed  below. 

Isolation  of  Full-Length  cDNA  Clones  for 

Transcription  Unit  7 

GT198  of  transcription  unit  7  revealed  a  message  of  1.6 
kb  with  dramatic  differences  in  expression  levels  between 
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FIG.  4.  Hybridization  of  cDNAs  to  poly(A)^  RNA.  GT211  (A)  and  GT198  (B)  are  shown  hybridized  to  2  of  poly(A)^  RNAs  of  the  tissues 
indicated.  The  blots  were  obtained  from  Clonetech  Labs.  The  sizes  of  the  markers  listed  to  the  left  and  the  sizes  of  mRNAs,  as  indicated  to  the 
right,  are  given  in  kilobases  (kb).  In  B,  the  exposure  time  of  the  blots  of  the  first  series  of  tissues  including  spleen,  thymus,  prostate,  testis, 
ovary,  small  intestine,  colon,  and  leukocyte  was  0.7  days,  while  the  exposure  time  for  the  remaining  lanes  shown  was  8  days. 


tissues.  Marked  abundance  was  observed  in  three  sam¬ 
ples  including  the  BT-474  cell  line  (Fig.  3),  and  testis  and 
colon  tissues  (Fig.  4B).  This  contrasted  with  the  barely 
detectable  level  observed  in  a  series  of  cell  lines  including 
one  derived  from  a  colon  carcinoma  (Caco-2)  and  several 
other  tissues  including  normal  mammary  gland.  As  dem¬ 
onstrated  in  the  central  and  right  portions  of  Fig.  4B,  low- 
level  expression  was  observed  upon  prolonged  expostore  of 
the  blots  in  all  tissues  tested.  Low-level  expression  was 
also  observed  in  fetal  samples  of  liver,  kidney,  lung,  and 
brain  (data  not  shown). 

Given  the  interesting  expression  pattern,  the  full- 
length  cDNA  for  GT198  was  obtained  by  repeated 
screenings  of  human  breast  cancer  cell  (ZR-75-1)  cDNA 
library  (Fig.  7).  The  length  of  this  cDNA  was  consistent 
with  the  size  of  the  mRNA  observed.  From  the  nucleo¬ 
tide  sequence  shown  in  Fig.  7B,  the  longest  putative 
open  reading  frame  included  203  amino  acids.  This  open 
reading  frame  orientation  was  consistent  across  splicing 
boundaries  of  the  introns  found  in  the  initial  GT198 
cDNA  fragment  and  is  preceded  by  a  5'  untranslated 
region  of  133  nucleotides.  Further,  a  polyadenylation 
consensus  signal  was  located  17  nucleotides  upstream 
from  the  poly(A)  tail.  The  expressed  sequence  tags  that 
could  be  aligned  (see  Table  1)  corresponded  to  287  nucle¬ 
otides  of  the  3  '-untranslated  region  (EST  Z38575)  and 
to  the  99  nucleotides  of  codons  73  through  106  (EST 
42353)  within  the  open  reading  frame.  It  was,  however, 
not  possible  to  gain  insight  into  the  function  of  this  gene 
as  no  known  protein  motifs  were  recognized. 

DISCUSSION 

All  of  the  transcripts  have  not  been  explored  in  de¬ 
tail,  and  transcription  units  were  defined  only  when 


evidence  for  RNA  hybridization  was  detected.  Four  of 
the  selected  cDNA  fragments  did  not  detect  discrete 
mRNAs  in  total  RNA  and  thus  were  not  aligned  into 
transcription  units.  Based  on  analysis  by  GRAIL,  they 
did  not  reveal  strong  potential  for  coding  sequence,  as 
complete  open  reading  frames  could  not  be  detected  in 
these  clones.  However,  evidence  that  they  corres¬ 
ponded  to  additional  genes  was  observed,  as  three  of 
these  four  fragments  displayed  cross-species  hybridiza¬ 
tion.  Further,  as  indicated  in  Table  1,  GT232  aligned 
to  an  expressed  sequence  tag.  The  sensitivity  of  direct 
selection  is  very  high,  and  it  is  possible  that  these  frag¬ 
ments  correspond  to  genes  that  are  restricted  in  their 
expression  pattern  or  are  expressed  at  very  low  level, 
below  the  detection  limit  of  RNA  hybridization.  As  the 
clones  GT219,  GT227,  and  GT235  mapped  to  the  same 
interval,  it  is  possible  that  they  are  distinct  portions  of 
the  same  rare  transcript. 

Only  four  tissues  were  used  as  the  cDNA  sources  of 
the  selection  experiments,  so  it  cannot  be  determined 
whether  all  transcribed  sequences  from  the  region  ana¬ 
lyzed  have  been  identified.  However,  the  region  exam¬ 
ined  was  found  to  be  relatively  rich  in  genes.  These 
included  genes  that  are  expressed  at  relatively  low  lev¬ 
els  as  well  as  those  that  appear  broadly  and  abundantly 
expressed  in  several  tissues.  A  summation  of  the  sizes 
of  all  of  the  transcripts  detected  suggests  that  up  to 
25-30  kb  of  the  genomic  DNA  from  the  320-kb  region 
analyzed  is  present  in  mature  transcripts.  It  was  nota¬ 
ble  that  long  range  mapping  with  rare  cutting  enzymes 
did  not  predict  the  high  density  of  genes  found. 

While  initial  findings  based  on  the  tissue-specific  ex¬ 
pression  pattern  indicated  that  GT198  may  have  been 
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TABLE  1 

Summary  of  cDNA  Clones 


Isolated  breast  cDNA  subclones 
(HLl061b  cDNA  library) 

Retrieved  GT  clones 

Size  of 

Transcription  unit 

Name 

Size 

(bp) 

cDNA 

source 

Isolated 

cDNA 

Clones 

Size  of  isolated 
clones  (kb) 

longest 

contig 

(kb) 

GenBank 

accession 

number 

Number 

Transcript 
size  (kb) 

Homologies 

GT208 

GT213 

GT214 

GT217 

GT252 

554 

554 

579 

272 

Caco-2 

Caco-2 

Caco-2 

Placenta 

Caco-2 

4 

2 

1.0,  1.7,  2.0  and  3.8 

0.7  and  3.8 

0.6 

0.6 

— 

1 

3.2 

Both  contigs  identical  to  human  Ki  antigen 
cDNA  (GenBank  Accession  No.  U11292) 
GT208  contig  =  U11292  bases  735-1289 

GT217  contig  =  U11292  bases  1860-2442 

EST  M85569  =  GT217  contig  bases  560-355 
GT217  homolgous  to  part  of  mouse  EST 
#L26609 

GT197 

GT199 

T243 

r338 

628 

860 

627 

657 

Caco-2 

Caco-2 

Caco-2 

Placenta 

3 

3 

0.8,  1.5,  and  2.9 

0.85,  1.6,  and  2.8 

1.5 

L38932 

2 

2.4 

EST  D20904  =  GT197  contig  bases  1377-1530 

GT196 

GT233 

486 

625 

Caco-2 

Caco-2 

9 

0.7,  0.750,  1.0,  and  1.8 

0.6 

L38939 

3 

1.0 

No  homology 

GT247 

392 

Ovary 

- 

— 

— 

L38941 

4 

0.3 

Human  homolog  of  rat  ribosomal  protein  L34 
cDNA  (GenBank  Accession  No.  X14401) 

GT211 

GT300 

819 

607 

Caco-2 

Ovary 

5 

1.0,  1.6,  1.9,  2.2,  and 

3.4 

0.8 

L38934 

5 

5.0  and  6.5 

Similarity  with  part  of  D.  melanogaster 

Enhancer  of  zeste  (E(z))  cDNA  (GenBank 
Accession  No.  U00180) 

GT209 

GT234 

631 

844 

Caco-2 

Caco-2 

6 

1.8 

The  two  original  clones  identical  to  human  y- 
tubulin  cDNA  (GenBank  Accession  No. 
M61764) 

GT209  bases  8-607  =  M61764  bases  33- 
631 

GT234  =  M61764  bases  716-1559 

GT198 

418 

Caco-2 

2° 

0.9  and  1.5 

1.5 

L38933 

7 

1.6 

EST  Z42353  bases  103-201  =  GT198  contig 
bases  353-451 

EST  Z38575  -  GT198  contig  bases  1439-1153 

GT212 

GT236 

GT339 

374 

990 

374 

Caco-2 

Caco-2 

Caco-2 

4 

0.8,  1.2,  2.1,  and  4.2 

1.2 

L38935 

8 

4.5 

EST  T17366  =  GT212  contig  bases  1-322 

GT207 

1216 

Caco-2 

— 

— 

— 

— 

9 

2.3  and  1.3 

Identical  to  human  17/3-HSD  type  I  cDNA 
(GenBank  Accession  No.  X13440  bases 
35-1250) 

GT218 

GT226 

GT298 

GT304 

GT305 

615 

-300 

-750 

-700 

-418. 

Caco-2 

Caco-2 

Caco-2 

Caco-2 

Mammary 

gland 

1 

6 

2.9 

1.2,  1.4,  1.6,  2.1,  2.3, 
and  2.4 

1.1 

L39000 

10 

1.9  and  4.0 

GT218  contig  is  the  human  homolog  of  a  rat 
tyrosine  phosphatase  cDNA  (GenBank 
Accession  No.  L27843) 

EST  Z30155  =  GT218  contig  bases  583-882 

EST  T10429  =  GT218  contig  bases  399-609 
GT305  contig  bases  1-136  =  Z15428  bases 
200-336 

GT219 

535 

Caco-2 

__ 

— 

— 

L38936 

— 

n.d. 

No  homology 

GT227 

303 

Ovary 

1 

1.3 

0.3 

L38937 

— 

n.d. 

No  homology 

GT232 

423 

Ovary 

1 

2.9 

0.5 

L38938 

— 

n.d. 

EST  Z40888  =  GT232  contig  bases  472- 180 

GT235 

537 

Caco-2 

10 

0.8,  1.2,  1.3,  1.4,  2.0, 
and  2.3 

0.7 

L38940 

— 

n.d. 

No  homology 

Note  A  summary  of  the  characterization  of  the  retrieved  clones  and  the  corresponding  isolated  cDNAs  from  cDNA  libraries  are  listed. 
The  clones  are  grouped  corresponding  to  the  transcription  units  shown  in  Fig.  1.  For  the  selected  clones,  the  tissue  source  could  be  identified 
by  the  sequence  tag  that  had  been  incorporated  during  the  synthesis  of  cDNA.  The  longest  length  of  the  cDNA  “contig"’  given  m  the  sixth 
column  was  obtained  by  compiling  overlapping  sequences  of  selected  clones  and  longer  cDNAs.  The  entire  contig  sequence  was  assigned 

the  GenBank  accession  numbers  that  are  indicated,  n.d.  denotes  none  detected.  ,  t  l  TTT-!Acrvu\ 

"  The  cDNA  library  screened  for  GT198  was  made  from  the  ZR-75>1  human  breast  carcinoma  cell  line  (Clonetech  Labs,  HL1059b). 

an  interesting  candidate  gene  for  involverriGnt  in  breast  Simard  Qt  al.,  1994,  Friedman  et  al.,  1994a)  ai^e 

cancer,  the  recent  identification  of  disease-associated  against  this  being  BRCAl.  The  region  surrounding 

mutations  in  a  putative  transcription  factor  (Miki  et  GT198  is  not  amplified  in  the  BT-474  cell  line  (data 

al,  1994;  Futreal  et  al,  1994a;  Castilla  et  a/„  1994;  not  shown),  but  the  interesting  expression  pattern  ob- 
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GT247  cDNA 
rat  L34  cDWA 


GATG  *TCTGCAGGCACTCAGA 
-T--C-G-AG - G 


GT247  cDNA 
rat  L34  cDNA 


■?G  "c  cic  cc?  T?G  Icl  cll  erf  n“ 


CGT  TTG  ACA  TAG  CGA  CGT  AGG  CTT  TCC  TAG  AAT  AGA  GGG  TGT  A.AC 
. . --T  ---  ---  - -  ...  ... 


AAA  ACT  AGG  CTG  TCC  CGA  ACC  CCT  GGT  AAT  AGA  ATT 


GT247  cDNA 
rat  L34  cDNA 


GTT  TA^  S  St  IcC  Sg  Sg  G^T  GGG  gS  CgI  gS  ^G^  gS  gJg  Sc  CcS  gS  cS  G^^  COT  OT  OT^  AgS 

.  - ’■ . . -C  -  -TG - G-  -G . -  -C  GT-  G-  -G  — 

Val  Ala 


GT247  CDNA 
rat  L34  cDNA 


5  H  B  ??  ??  ""  ^  =CT  ^  TOT  GOT  COT  Sc  ^  210 

G  --  -  G  -C  --T  CAG  CA-  -G-  CTA  T-G  T-G  CT-  CAT  GTG  CGC  C--  GTG  TG-  -CG  _ 

Gin  Gin  Gly  Leu  Trp  Trp  Leu  His  Val  Arg  Gin  Val  Cys  Pro 


GT247  CDNA 
rat  L34  cDNA 


lie  Lys  Arg  Ala  Phe  Leu  lie  Glu  Glu  Gin  Lys  lie  lie  Val  Lys  Val 
AiC  AAG  CGT  GCT  TTC  CTT  ATC  GAG  GAG  CAG  AAA  ATC  ATT  GTG  AAA  GTG 


Val  Leu  _ys  Ala  Gin  Ala  Gin  Ser  Gin  Lys  Ala  Lys  End 

GTG  TTG  A.AG  GCA  CAA  GCA  CAG  AGT  CAG  AAA  GCA  AAA  TAA  ATC-G’ *G 


GT247  cDNA 
rat  L34  cDNA 


•AAACTTTTTTTAAJW^AAAAAAAAAA  38  6 

C-G - A-GT--T - TC-  -GCTCTCTCTAAA-.AA 


indicated  above,  are  shown  numbered  according  to  the  putative  ATG  initiation  codon  of  the  rat  L34 


served  suggests  that  additional  analysis  of  this  gene 
may  be  warranted  in  sporadic  breast  cancers. 

The  complete  characterization  of  the  putative  tran¬ 
scription  factor  corresponding  to  the  transcription  unit 
detected  with  GT211  will  also  be  of  interest.  While  a 
role  for  this  gene  cannot  yet  be  accurately  postulated 
in  human,  the  role  of  E(z)  in  Drosophila  is  to  provide 
a  negative  controlling  component  or  feature  of  tran¬ 
scription  of  the  Antennapedia  and  Bithorax  gene  com¬ 
plexes.  This  is  thought  to  involve  the  counteraction  of 
the  E(z)  gene  and  the  trithorax  gene  {trx)  products.  It 
is  interesting  to  note  that  the  human  homologue  of  the 
trx  gene,  which  maps  to  chromosome  llq23,  has  been 
shown  to  be  involved  in  translocations  in  acute  leuke¬ 
mias  (Djabali  et  al,  1992;  Tkachuk  et  aL,  1992).  We 
speculate  that  it  is  possible  that  abnormal  depletion  or 


inactivation  of  a  negative  component  of  transcription 
is  comparable  to  the  abnormal  activation  of  a  positive 
component  of  transcription  that  occurs  via  transloca¬ 
tion.  Consequently,  aberrations  of  activities  of  the  tran¬ 
scription  unit  detected  by  GT211  may  also  be  involved 
in  some  cancers. 

Two  sets  of  clones  appeared  to  have  features  that 
emphasized  the  limitations  of  direct  selection  proce¬ 
dures  for  the  identification  of  genes.  For  example,  the 
sequencing  analysis  of  GT198  indicated  that  this  se¬ 
lected  clone  likely  contained  at  least  a  portion  of  one 
intron  and  another  intact  intron.  This  was  determined 
by  alignment  of  sequence  to  longer  cDNAs  that  were 
subsequently  isolated  from  a  ZR-75-1  human  breast 
cancer  cell  cDNA  library.  The  alignment  of  identical 
sequence  ended  abruptly  within  the  cDNA,  and  se- 


^  Thr  Gly  Ala  Glu  Glu  Ser  Leu.  Phe  Arg  Val  Phe  His  Gly  Thr  Tyr 

TGG  ACT  GGG  GCT  GAA  GAA  TCT  CTT  TTT  CGA  GTC  TTC  CAT  GGC  ACC  TAC 

A  C  --C  C  G-A  -  -A-  --G  --T  --G  -  AAA  GT-  --T 

Asp  Gin  Ala  Tyr  Leu  Lys  Val 

Ala  Val  Lys  Glu  Ser  Leu  He  Leu 
ACG  TGC  AAG  CAG  GTC  TTT  CAG  TTT  GCA  GTC  AAA  GAA  TCA  CTT  ATC  CTG 

CGT  --A  --G  -AC  G . -C  CAG  --G  —  GAC  GC-  GAG  T-T 

Gin  Asp  Ala  Glu  Phe 

“is  Cys  Arg  Lys  He  Gin 

AAG  AAA  AGA  AAG  CAC  AGA  TTG  TGG  GCT  GCA  CAC  TGC  AGG  AAG  ATT  CAG 

-  G  -AG --A  C-T  C-- T-G  CTG — T  C-T 

Lys  Gin  Ser  Leu 


rtsn  Asn  ?ne  Cys  Ser  He  Ala  Arg  His  Met  Gly  Thr  Lys 
TTC  AAC  AAC  TTC  TGT  TCA  ATA  GCC  AGG  CAT  ATG  GGG  ACC  aIg  '  593 

-A  --G  -A-  --C  G-G  — T  --A  CAC  A-C - CT-  ---  1531 

Leu  Lys  Tyr  Als  His  Asr.  Leu 

***  Leu  Met  Asn  Pro  Ser  Gin  Lys 

^C  T  c  rfr  ™  ***  680 

Ser  Ph^  "GC  -—  1621 

Ser  Phe  Glu  .nsp  Leu  Arg  Gin  Asp  Phe  Thr  ?ro  Arg 

Leu  Lys  Lys  Asp  Asn  Ser  Ser  Thr  Gin  Val  Tyr  Asn  Tyr  Gin 

CTG  AAG  AAA  GAT  AAC  TCT  TCC  ACA  C.AA  GTG  TAC  AAC  TAC  C.AA  770 

-T - G  -C  TCA  -C  -G  -AT  -C - ACG  1711 

Ser  Asn  His 


GT711  rrr  Srr  Pro  Cys  He 

GT211  CCC  aGC  GAC  CAC  CCA  GAC  CGC  CCC  TGT  GAC  AGC  ACC  TGC  CCC  TGC  ATC  817  -  ^ 

E(z)  - T  -C  -GA  -AT - C - TG  -A - T - T  1759 

Gly  His  Met  Asn  Ser 

FIG.  6.  ^i^ment  of  GT211  with  Enhancer  of  zeste.  Alignment  of  the  nucleotide  and  predicted  amino  acid  spouptipp  rT9i  i  o 
portion  of  the  E(z)  gene  of  D.  melanogaster  is  shown.  Nucleotides  are  numbered  to  the  riffht  of  poch  lino  f  j*  u 

GenBank  entries  of  GT211  (L38934)  and  of  E(z)  (UOOISO).  The  sequence  of  GT211  cDNA  is^given*  identical 
are  indicated  by  dashes  or  the  corresponding  nucleotides,  respectively.  Only  the  different  amino  acid  residues  ofEfz)  arp  indVat 

sSafrunXiinl^  acids.  A  puta'tive  nutart^^^^^^^ 
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3  ATCAAGGTG^TCCC  -120 

i”  s  “  «  “  s:  “  s  fi  ss  S'  ™  ™  “  a:  “  “  “  '.s  “  s  =s  z  « 

sEssssisss-ssstssiSssjssstJsssssxssisisssssss;  » 

s z z z z z s S! s s: s s a: ™ ss a: s: s  a; 
z  If.  s  at  at  St  a:  z  ;s  s  s  ™  st  ts  ;s  z  s  st  “  at  a;  at  z  ts  a:  a:  s  at  at  ta  ». 

taststsstssttssatattaatattastatatstsatssKtsttataatatst  ;t: 

s;ss;as:taa:“““;s“a:a;s'.sst“ststaesattaa;a;asa:a;“  ts 

s  at  a;  at  at  ts  at  a:  st  at  st  a:  a;  a;  a:  at  at  at  at  s  st  at  a:  at  —  - 


ACC  ACA 
Thr  Thr 

AAT  CAT 
Asn  His 

AGA  GCT 
Arg  Ala 


FIG.  7.  Wl.l.,«.h  cDNA  fo,  GTO8.  (A,  S.h.n...«  of .».  •X",': 

:r„"  »“■  poiy.o»y.«»  f®*' « •»• 

underlined  with  a  solid  line. 


quences  consistent  with  two  3'  acceptor  sites  and  a 
single  5^  donor  site  were  found  in  the  selected  fragment 
with  appropriate  orientation.  Furthermore,  the  se- 
quence  of  the  longer  cDNA  was  consistent  with  the 
maintenance  of  open  reading  frame.  We  have  observed 
that  clones  containing  introns,  which-we  assume  must 
originate  from  a  rare  RNA  population,  may  be  favored 
for  selection  during  the  hybridization  steps.  Our  find¬ 
ings  have  been  supported  by  observing  that  unpro¬ 
cessed  cDNAs  are  more  commonly  retrieved  when 
cDNA  of  total  cell  RNA  is  used  compared  to  cDNA  of 
poly(A)*  RNA  (Rommens  et  al,  1994,  data  not  shown). 
It  is  unlikely  that  the  introns  originate  from  genomic 
DNA,  as  the  scheme  used  to  generate  cDNA  does  not 
involU  the  ligation  of  oligonucleotide  linkers  (Rom¬ 
mens  et  al.,  1993). 

A  second  limitation  of  direct  selection,  the  conse¬ 
quences  of  gene  families  or  pseudogenes,  was  observed 
by  analyzing  the  transcribed  unit  defined  by  clones 
GT218,  GT298,  GT304,  and  GT305.  Evidence  that 
these  clones  detect  a  gene  family  was  obtained,  so  it 
could  not  be  ascertained  which  of  two  (detected  by 


GT218)  or  three  (detected  by  GT305)  mRNAs  (Fig.  3) 
detected  actually  originated  from  chromosome  17.  The 
hybridization  pattern  to  genomic  DNA  for  this  group 
of  clones  was  complex  compared  to  the  YAC  DNA  or 
chromosome  17  hybrid  cell  line.  With  GT218,  three  dis¬ 
crete  EcoRI  fragments,  suggesting  three  copies  or  three 
exons,  were  found  to  hybridize  from  chromosome  1,  but 
only  single  fragments  hybridized  from  other  chromo¬ 
somes,  including  6, 11, 12,  and  17  (data  not  shown).  It  is 
possible  that  the  sequences  on  chromosome  17  actually 
correspond  to  a  pseudogene  or  represent  a  single  mem¬ 
ber  of  a  highly  conserved  family,  but  the  issue  cannot 
be  resolved  from  our  current  data.  This  would  require 
complete  sequencing  of  the  hybridizing  genomic  re- 

We  have  compsired  the  transcription  map  generated 
from  the  region  surrounding  the  HSD17B  locus  to  re¬ 
cently  described  physical  (Couch  et  al.,  1995)  and  tran¬ 
scription  maps.  These  studies  used  exon  trapping,  con¬ 
served  sequence  screening,  CpG  island  rescue  (Brody 
et  al,  1995),  and  direct  library  screening  with  YAC  or 
cosmld  DNA  (Brody  et  al,  1995;  Friedman  et  al,  1995), 
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in  addition  to  direct  cDNA  selection  methods  (Brody  et 
al,  1995;  Osbome-Lawrence  etal,  1995),  as  strategies 
to  identify  transcribed  sequences.  Transcription  units 
1  and  9  (Albertsen  et  al,,  1994b),  which  had  been  de¬ 
scribed  earlier,  and  units  5  and  6  were  common  to  all 
maps  and  could  be  readily  identified  by  alignment  of 
sequences. 

Transcription  units  2,  3,  7,  and  8  could  also  be  as¬ 
signed  to  transcripts  of  comparable  sizes  in  at  least  two 
studies  by  partial  overlap  with  expressed  sequences 
that  have  been  recently  entered  into  public  databases 
(Brody  et  ah,  1995;  Friedman  et  ah,  1995).  Overall 
there  was  agreement  on  the  relative  position  of  each  of 
these  transcription  units,  and  supporting  information 
that  transcription  units  7  and  8  are  in  the  correct  order, 
as  shown  m  Fig.  1,  was  evident  (Couch  et  ah,  1995* 
Brody  et  ah,  1995). 

Of  the  expressed  sequences  that  were  obtained  that 
were  not  assigned  into  transcription  units,  only  GT235 
(Accession  No.  L38940)  was  found  to  overlap  with  an 
exon-trapped  clone  (Accession  No.  T27158;  Brody  ah, 
1995)  of  one  study.  Unassigned  cDNA  fragments  were 
isolated  in  all  of  the  studies,  and  it  is  possible  that  some 
correspond  to  other  common  transcripts  that  could  not 
be  recognized,  as  different  nonoverlapping  fragments 
have  been  isolated.  The  most  noteworthy  difference  be¬ 
tween  the  transcription  map  that  we  have  generated 
and  those  of  others  is  the  absence  of  the  pseudogene 
for  the  high-mobility  group  protein,  HMG17  (Brody  et 
ah,  1995;  Friedman  et  ah,  1995;  Osbome-Lawrence  et 
ah,  1995).  We  also  do  not  have  evidence  for  either  the 
T-eta  transcript  (5.0-kb  mRNA,  Brody  et  ah,  1995)  or 
the  6.5-kb  transcript  (Friedman  et  ah,  1995)  that  were 
obtained  but  that  would  map  between  units  5  and  6, 
or  for  a  2.5-kb  transcript  (Friedman  et  ah,  1995)  that 
would  be  positioned  near  transcription  unit  7.  Alterna¬ 
tively,  neither  sequence  overlap  nor  mRNA  size  permit¬ 
ted  recognition  of  either  transcription  units  4  or  10  in 
the  alternate  maps,  although  it  is  possible  that  portions 
of  unit  10  were  obtained  based  on  physical  position  of 
unassigned  exon-trapped  clones  (Brody  et  ah,  1995).  As 
the  different  transcripts  have  not  been  examined  in 
sufficient  detail,  it  is  not  possible  to  know  whether  dif¬ 
ferences  in  these  maps  reflect  incomplete  analysis  of  , 
retrieved  clones  or  whether  the  tissues  used  for  the 
different  strategies  and  subsequent  cDNA  screenings  < 
were  not  adequate  to  retrieve  all  genes. 
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SUMMARY 


We  have  analysed  20  breast/ovarian  cancer  families,  the 
majority  of  which  show  positive  evidence  of  linkage  to  chromosome 
17ql2,  for  germline  mutations  in  the  BRCAl  gene.  BRCAl  mutations 
cosegregating  with  breast  and  ovarian  cancer  susceptibility  were 
identified  in  16  families,  including  one  family  with  a  case  of 
male  breast  cancer.  Nine  of  these  mutations  have  not  been 
reported  previously.  The  majority  of  mutations  were  found  to 
generate  a  premature  stop  codon  leading  to  the  formation  of  a 
truncated  BRCAl  protein  of  2%  to  88%  of  the  expected  normal 
length.  Two  mutations  altered  the  RING  finger  domain. 
Sequencing  of  genomic  DNA  led  to  the  identification  of  a  mutation 
in  the  coding  region  of  BRCAl  in  twelve  families,  and  cDNA 
analysis  revealed  an  abnormal  or  missing  BRCAl  transcript  in  four 
of  the  eight  remaining  families.  A  total  of  eight  mutations 
were  associated  with  a  reduced  quantity  of  BRCAl  transcript.  We 
were  unable  to  detect  BRCAl  mutations  in  four  of  the  20  families, 
but  only  one  of  these  was  clearly  linked  to  BRCAl.  It  is 
expected  that  the  majority  of  clear  examples  of  the  breast- 
ovarian  cancer  syndrome  will  be  associated  with  germline 
mutations  in  the  coding  region  of  BRCAl. 


INTRODUCTION 

A  family  history  of  breast  or  ovarian  cancer  is  a  risk  factor 
for  the  development  of  cancer  of  either  type  (Lynch  et  al.  1981) . 
Segregation  analysis  suggested  the  existence  of  an  autosomal 
dominant  mutant  allele  (or  alleles) ,  accounting  for  from  5-10%  of 
breast  cancers  (Newman  et  al.  1988;  Claus  et  al.  1991) . _  Breast 
cancer  also  appears  in  families  with  ovarian  cancer  (Schildkraut 
et  al.  1989;  Go  et  al.  1983).  A  gene  responsible  for  hereditary 
breast  and  ovarian  cancer,  BRCAl,  was  mapped  by  linkage  to 
chromosome  17  in  1990  (Hall  et  al.  1990;  Narod  et  al.  1991)  and 
was  isolated  in  1994  (Miki  et  al.  1994).  This  large  gene 
contains  22  coding  exons  distributed  over  approximately  lOOkb  of 
genomic  DNA,  and  produces  a  protein  of  1863  amino  acids.  A  RING 
finger  motif  in  the  amino-terminal  portion  of  the  gene  suggests 
that  BRCAl  acts  as  a  transcription  factor  (Miki  et  al,  1994)  . 

In  a  recent  survey  of  63  cancer  families,  (Shattuck-Eidens  et 
al.  1995)  it  was  found  that  the  majority  (86%)  of  the  37 
different  BRCAl  mutations  were  either  frameshifts,  nonsense 
mutations  or  splice  site  mutations,  which  presumably  lead  to 
premature  truncation  of  the  BRCAl  protein.  Infrequent  regulatory 
mutations  associated  with  a  loss  of  transcript  corresponding  to 
the  mutant  BRCAl  allele  were  also  seen.  Missense  mutations  were 
reported  in  the  RING  finger  domain  and  elsewhere,  but  the 
functional  significance  of  most  missense  mutations  is  not  yet 
known . 

Although  many  breast-ovarian  cancer  families  have  been  found 
to  carry  mutations  in  the  coding  region  of  BRCAl,  mutations  have 
not  been  found  for  the  majority  of  cancer  families  studied  to 
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date  (Castilla  et  al.  1994;  Friedman  et  al.  1994;  Simard  et  al. 
1994;  Shattuck-Eidens  et  al.  1995;  Streuwing  et  al.  1995).  It  is 
not  yet  clear  if  the  absence  of  mutations  was  due  to  the 
imperfect  sensitivity  of  the  screening  test  used,  of  if  the 
mutations  were  outside  of  the  coding  region  of  the  gene,  or  if 
families  included  in  these  studies  were  due  to  susceptibility 
genes  other  than  BRCAl. 


Using  statistical  models,  Easton  et  al.  (1995)  estimated  that 
there  are  likely  to  be  two  types  of  BRCAl  families  --  those  with  a 
high  penetrance  of  ovarian  cancer  (84%  by  age  70  years)  and  those 
with  a  low  penetrance  of  ovarian  cancer  (32%  by  age  70  years) . 
No  particular  genotype-phenotype  correlations  have  yet  been 
revealed  which  support  this  theoretical  division.  We  have 
studied  a  panel  of  20  breast-ovarian  cancer  families,  each  with  a 
high  probability  of  linkage  to  BRCAl,  in  order  to  estimate  the 
sensitivity  of  direct  DNA  sequencing  and  SSCP  analysis  of  cDNA  as 
a  two-step  method  of  mutation  detection  and  to  identify  possible 
genotype /phenotype  correlations. 


METHODS 


Breast-ovarian  cancer  families  Each  of  the  20  families 
contained  a  ^minimum  of  four  cases  of  breast  or  ovarian  cancer, 
and  at  least  one  case  of  cancer  of  each  type.  These  families 
were  originally  chosen  for  linkage  studies,  and  therefore  blood 
samples  for  DNA  analysis  were  available  on  multiple  affected 
members  in  each  family.  Linkage  results  on  several  of  these 
families  have  been  reported  (Narod  et  al.  1991;  Feunteun  et  al. 
1993;  Simard  et  al.  1994;  Torchart  et  al.  1994;  Tonin  et  al. 
1994) .  All  families  but  one  are  followed  at  the  Department  of 
Preventive  Medicine  in  Creighton  University  School  of  Medicine  in 
Omaha;  family  4001  is  followed  at  the  Hopital  St.  Louis,  Paris. 

The  probability  of  linkage  to  BRCAl  for  each  of  these  families 
was  estimated  to  be  80%  or  greater.  These  probabilities  are 
based  on  the  prior  probabilities  of  linkage,  estimated  by  Narod 
et  al  (1995)  and  the  lod  scores  generated  by  BRCAl  markers.  For 
example,  it  is  estimated  that  81%  of  families  with  multiple  cases 
of  early-onset  breast  cancer  and  a  single  case  of  ovarian  cancer 
are  linked  to  BRCAl.  A  positive  lod  score  will  increase  the 
probability  of  linkage,  usually  to  90%  or  higher.  The 
characteristics  of  the  20  families  and  the  derived  probabilities 
of  linkage  are  presented  in  table  1.  The  average  age  of 
diagnosis  of  the  143  breast  cancers  in  these  families  was  41.8 
years,  and  of  the  54  ovarian  cancers  was  52.6  years.  One 
family  contained  a  case  of  male  breast  cancer. 
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DNA/RNA  extraction  DNA  was  extracted  from  Epstein-Barr  virus 
immortal i z ed  lymphoblastoid  cell  lines  using  standard  methods. 
RNA  isolation  was  performed  with  Trizol  reagent  (Gibco  BRL)  or 
with  the  QuickPrep  Micro  kit  (Pharmacia)  according  to  the 
instructions  provided  by  the  manufacturer. 


Linkage  analysis  To  establish  the  probability  that  a  family  was 
linked  to  BRCAl,  linkage  analysis  was  performed  using  five 
polymorphic  CA-repeat  markers  flanking  the  BRCAl  gene,  including 
the  intragenic  marker  D17S855.  The  other  markers  were  D17S250, 
D17S588,  D17S579  and  THRAl  (Easton  et  al.  1993).  Linkage  to  BRCA2 
was  assessed  using  the  chromosome  13  marker  AFM238zd9. 
Multipoint  linkage  was  performed  using  the  LINKAGE  program 
(Lathrop  and  Lalouel,  1984)  under  a  previously  described  model 
(Feunteun  et  al.  1993) .  Breast  and  ovarian  cancer  susceptibility 
was  modelled  as  a  dominant  trait  with  incomplete  penetrance. 
Affected  cases  were  assigned  to  liability  classes  which  reflect 
the  age-specific  penetrance  of  breast  and  ovarian  cancer.  The 
risks  of  sporadic  breast  cancer  (cancer  among  non-carriers)  were 
derived  from  the  cancer  rates  in  the  general  population. 


Detection  of  BRCAl  mutations  A  comprehensive  multistep  strategy 
for  BRCAl  mutation  identification  was  applied  to  the  20  families. 
The  first  step  was  the  direct  sequencing  of  the  entire  coding 
sequence  of  BRCAl  (with  the  exception  of  exon  4  which  was 
analysed  by  SSCP) .  Because  of  the  possibility  that  a  BRCAl 
mutation  may  exist  outside  of  the  coding  region  and  result  in  the 
production  of  a  mutant  RNA,  if  a  BRCAl  mutation  was  not  found  in 
the  genomic  DNA,  the  cDNA  was  analysed  for  mutations  by  SSCP  of 
RT-PCR  products.  When  abnormal  SSCP  bands  were  identified,  the 
corresponding  cDNA  fragment  was  analysed  by  direct  sequencing. 

In  addition,  it  is  also  possible  that  a  BRCAl  mutation  outside 
of  the  coding  region  may  result  in  the  loss  of  the  associated 
mRNA.  An  affected  individual  from  each  family  was  therefore 
studied  for  the  possibility  of  decreased  BRCAl  RNA  levels  in 
lymphocytes.  This  technique  is  based  on  the  identification  of 
constitutional  heterozygosity  at  one  of  a  number  of  intragenic 
BRCAl  polymorphisms.  RNA  loss  is  inferred  from  the  preferential 
presence  of  sequence  corresponding  to  the  normal  allele  observed 
in  the  cDNA  sequence  ladder  (figure  1) .  This  technique  is  used 
to  identify  the  class  of  inferred  regulatory  mutations,  which 
are  characterized  by  the  dominant  expression  of  the  normal  allele 
in  the  lymphocyte  DNA. 


o 

cDNA  synthesis  was  carried  out  at  37  for  one  hour  in  a  20  ul 
reaction  volume  containing  1  ug  of  total  cellular  RNA  or  30-50  ng 
of  polyA  RNA,  lx  buffer  (50  mM  tris  HCl  pH  8.3,  75  mM  KCl,  3  mM 
MgC12,  10  mM  DTT) ,  1  ug  random  primers  (Gibco  BRL)  500uM  dNTPs 
(Pharmacia)  lOu  RNAs  (Promega)  and  3u  MMLV  reverse  transcriptase 
(Pharmacia) . 
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47  primer  pairs  were  used  to  amplify  all  BRCAl  exons  from 
genomic  DNA,  and  33  primer  pairs  were  used  to  amplify  BRCAl  cDNA. 
PCR  was  performed  in  25  ul  solutions  containing  20-50  ng  of 
genomic  DNA  or  cDNA,  lx  PCR  buffer,  200  urn  dNTPs  (Pharmacia) ,  20 
pmoles  of  each  primer  (Genset) ,  1  uCi  33P  dCTP  (Amersham) ,  lu  Tag 
DNA  polymerase  (Eurobio) .  Reactions  were  performed  in  a  thermal 
cycler  Perkin-Elmer  Cetus  model  9600  as  follows:  94oC  for  3 
minutes,  35  cycles  (94o  for  30s,  annealing  temperature  for  30s, 
72o  for  30s)  72o  for  10  minutes.  PCR  products  were  denatured  at 
90oC  for  4  minutes  and  run  on  Hydrolink  MDE  gel  (AT  Biochem)  at 
constant  power  of  6W  for  16h  in  0.6xTBE.  The  gels  were  dried  and 
autoradiographed  for  12-24  hours.  The  primer  sequences  used  to 
amplify  the  exons  were  from  the  FTP  file  at  Morgan . Med . UTAH . ed , 
except  for  exons  1,  4,  6,  7  and  11,  for  which  those  primers 
published  by  Friedman  et  al  (1994)  were  used.  The  non-coding 
exon  1  and  exon  4  were  tested  by  SSCP  analysis. 


Direct  sequencing  of  PCR  fragments  was  performed  with  the 
sequenase  PCR  product  sequencing  kit,  using  the  same  primers  as 
for  PCR  amplification.  The  samples  were  migrated  on  6% 
acrylamide  gels  at  70W.  Gels  were  dried  and  exposed  to  X-ray 
film  for  12  hours. 


DNA  samples  from  one  individual  from  each  family  was  chosen 
for  sequencing.  The  chosen  individual  in  each  family  was 
observed  to  carry  the  haplotype  associated  with  cancer 
susceptibility  in  the  family. 


When  a  mutation  leading  to  a  truncated  protein  was 
identified,  the  remaining  exons  were  not  sequenced.  When  a 
missense  mutation  was  identified,  the  entire  coding  sequence  was 
analysed  to  rule  out  the  presence  of  an  additional  variant.  When 
mutations  were  found  at  splice  sites,  sequencing  of  RT-PCR 
products  was  performed  in  order  to  identify  the  consequence  of 
the  BRCAl  mutation  at  the  RNA  level.  The  splice  site  sequences 
were  screened  for  mutations  by  SSCP  and  confirmed  by  direct 
sequencing  of  PCR  fragments  amplified  from  cDNA.  Mutations  were 
confirmed  by  sequencing  DNA  from  two  other  individuals  in  the 
family  who  were  presumed  to  carry  the  mutation  as  well. 


Loss  of  transcript  corresponding  to  the  BRCAl 
susceptibility  allele  was  performed  by  comparing  the  genomic  DNA 
sequence  and  the  cDNA  sequence  for  loss  of  heterozygosity  at 
polymorphic  sites  (identified  while  sequencing  the  coding 
region) .  It  was  not  possible  to  assay  for  loss  of  transcript  in 
three  families  (family  32,  2619  and  1813)  because  no 
heterozygous  individuals  who  carried  BRCAl  mutations  were 
identified  in  these  families. 
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RESULTS 


A  total  of  16  out  of  the  20  families  studied  were  found  to 
have  BRCAl  mutations  cosegregating  with  breast  and  ovarian  cancer 
susceptibility  (table  2).  Twelve  of  these  mutations  occurred  in 
the  coding  region,  and  were  identified  through^  the  initial 
sequencing  of  genomic  DNA.  Eleven  of  these  were  either  nonsense 
or  frameshift  mutations,  and  one  was  a  missense  mutation.  Only 
one  mutation  was  seen  more  than  once;  the  deletion  of  AG 
dinucleotides  at  codon  22  was  seen  in  families  2979  and  3079. 


The  genomic  screen  also  revealed  the  presence  of  a  splice-site 
mutation  in  family  1234.  This  mutation  was  detected  in  the 
acceptor  site  of  intron  18,  and  leads  to  the  deletion  of  a  single 
nucleotide.  RT-PCR  analysis  performed  on  a  carrier  from  this 
family  revealed  that  this  mutation  causes  the  elimination  of  exon 
19  in  the  mutant  transcript,  and  premature  termination  at  codon 
1732.  The  levels  of  normal  and  mutant  BRCAl  transcripts  in  the 
lymphocytes  of  several  carriers  in  family  1234  were  studied.  The 
mutant  transcript  was  found  to  be  present  at  a  much  lower 
intensity  than  the  normal  allele  in  each  case,  and  suggests  that 
this  mutation  is  an  alteration  in  RNA  metabolism  (instability, 
transcription,  transport) .  This  BRCAl  mutation  has  not  been 
reported  previously,  but  examples  of  truncating  mutations  in 
other  genes  provoking  decreased  or  absent  mRNA  levels  have  been 
noted  (Hamosh  1991;  Baserga  1992;  Mashima  1992;  Stolzenberg  et 
al,  1994). 

The  RNA  sequence  was  examined  in  carrier  individuals  from  the 
seven  families  for  which  the  genomic  sequence  was  normal.  This 
was  done  by  performing  SSCP  analysis  on  the  RT-PCR  generated  cDNA 
product.  One  additional  mutation  was  detected  by  this  method. 
In  family  2850  cDNA  analysis  of  lymphocytes  revealed  that 
transcripts  were  missing  exon  5  in  three  carriers.  However,  no 
alteration  in  the  splice  sites  of  exon  5,  or  elsewhere  in  the 
genomic  sequence  was  identified.  This  is  similar  to  the  variant 
lacking  exon  3  in  the  cDNA  of  carriers  reported  in  one  of  the 
Berkeley  families,  where  no  genomic  change  was  identified 
(Friedman  et  al.  1994).  Exon  5  contains  three  cysteine  residues 
which  participate  in  the  formation  of  the  second  zinc  finger  of 
the  RING  domain,  and  the  absence  of  this  exon  would  be  expected 
to  have  a  significant  effect  on  the  DNA  binding  properties  of  the 
BRCAl  protein. 

Four  of  the  mutations  have  been  seen  previously  (Shattuck- 
Eidens  et  al,  1995)  including  the  mutation  in  family  1252,  in 
which  the  RING  finger  cysteine  at  position  61  is  replaced  with 
glycine  (Friedman  et  al,  1994) . 

Of  the  six  remaining  families  for  which  the  genomic  and  cDNA 
analyses  were  normal,  two  had  regulatory  mutations  inferred  from 
the  observation  of  decreased  levels  of  mRNA.  cDNAs  of  presumed 
mutation  carriers  from  families  2090  and  3173  showed  strong 
reduction  in  heterozygosity  at  the  intragenic  polymorphic  sites. 
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implying  a  greatly  reduced  BRCAl  transcript  level  (figure  1) .  No 
nucleotide  changes  in  genomic  DNA  were  revealed  in  these  two 
families.  In  the  cDNA  of  non-carriers,  both  alleles  were  found 
to  be  of  equal  intensity.  It  was  not  possible  to  perform  this 
analysis  on  families  32  and  2619  because  no  BRCAl  heterozygous 
individuals  were  identified. 

The  relative  expression  level  of  the  mutant  BRCAl  allele  was 
also  examined  for  the  other  identified  mutations,  where  possible. 
Four  mutations  were  associated  with  a  reduced  relative  amount  of 
transcripts,  as  compared  to  their  normal  counterparts  (table  2) . 


It  has  been  suggested  that  mutations  in  the  5'  end  of  the 
BRCAl  gene  may  be  associated  with  an  increased  risk  of  ovarian 
cancer  (Friedman  et  al.  1994;  Shattuck-Eidens  et  al.  1995).  We 
saw  little  evidence  to  support  this  hypothesis  in  our  families. 
In  fact,  family  1234,  which  contains  nine  cases  of  breast  cancer 
and  ten  cases  of  ovarian  cancer  carries  a  mutation  leading  to  a 
truncated  protein  of  nearly  full  length.  There  were  no 
significant  differences  in  the  age  of  onset,  or  of  the  ratio  of 
ovarian  to  breast  cancers  for  mutations  in  the  3'  versus  5'  half 
of  the  gene.  However,  the  ratio  of  cases  of  ovarian  to  breast 
cancer  was  higher  in  families  with  missing  or  reduced  BRCAl 
transcript  (0.54)  compared  to  families  with  mutations  associated 
with  equal  expression  of  normal  and  mutant  BRCAl  alleles  (0.30; 
p  =  0.14).  Among  the  eight  families  with  normal  RNA  levels, 
there  were  three  families  that  had  mutations  leading  to 
termination  at  codon  39;  the  ratio  of  ovarian  to  breast  cancers 
was  higher  in  these  three  families  (0.53)  than  in  the  other  five, 
which  carried  more  distal  mutations  (0.19;  p  =  0.16). 

It  was  originally  reported  that  families  with  cases  of  male 
breast  cancer  were  unlinked  to  BRCAl  (Stratton  et  al.  1994;  Narod 
et  al.  1995).  Family  2651  (figure  2)  is  the  third  example  of  a 
BRCAl-linked  family  which  contains  a  case  of  male  breast  cancer 
(Streuwing  et  al.  1995,  Hogervorst  et  al.  1995).  We  were  able  to 
demonstrate  the  presence  of  the  BRCAl  mutation  in  the  male  with 
breast  cancer (individual  452).  It  now  appears  that  the  risk  of 
breast  cancer  is  in  fact,  increased  in  males  carrying  BRCAl 
mutations. 

We  failed  to  find  BRCAl  mutations  in  families  1816,  32,  2619, 
and  1882.  Only  family  1816  is  clearly  linked  to  BRCAl.  The 
multipoint  lod  score  with  D17S579  and  THRAl  is  3.62.  RNA  levels 
associated  with  both  BRCAl  alleles  were  normal.  Family  32 
contains  two  cases  of  ovarian  cancer  and  10  cases  of  breast 
cancer  (figure  3).  It  originally  appeared  that  this  family  was 
linked  to  BRCAl.  However,  in  1994  individual  141  was  diagnosed 
with  breast  cancer  at  age  50.  She  does  not  share  a  BRCAl 
haplotype  of  risk  with  the  other  affected  women,  and  the  lod 
score  is  therefore  slightly  negative.  It  is  possible  that  she  is 
a  sporadic  case.  Family  2619  (figure  4)  contains  two  cases  of 
breast  cancer  of  relatively  late  onset  (mean  age  53.5  years). 
The  relative  frequency  of  ovarian  cancer  and  the  presence  of 


7 


early-onset  colon  cancer  suggests  hereditary  non-polyposis  colon 
cancer  as  an  alternate  diagnosis.  Family  1882  had  the  highest 
average  age  of  onset  of  breast  cancer  in  the  study  (55.8  years), 
and  only  a  single  case  of  ovarian  cancer.  There  was  no  evidence 
from  linkage  supporting  a  BRCAl  origin  for  this  family. 


All,  or  nearly  all,  breast-ovarian  cancer  families  are 
believed  to  be  linked  either  to  BRCAl  or  to  BRCA2.  Among  145 
breast  ovarian  cancer  families  studied  by  the  Breast  Cancer 
Linkage  Consortium,  there  was  no  clear  example  of  a  family  that 
was  unlinked  to  both  loci  (Narod  et  al,  in  press) .  It  is 
expected,  therefore,  that  breast-ovarian  cancer  families  unlinked 
to  BRCAl  should  have  a  high  probability  of  being  linked  to  BRCA2, 
as  was  found  to  be  the  case  in  the  Consortium  data  set. 
However,  the  lod  scores  for  linkage  to  BRCA2  for  families  32, 
2619  and  1882  were  -2.50,  -1.00,  and  -0.51,  respectively,  and  do 
not  suggest  BRCA2  as  the  cause  of  cancer  susceptibility  in  these 
families.  The  presence  of  undetected  BRCAl  mutations  remains  the 
more  plausible  explanations. 


DISCUSSION 


Sensitivity  of  approach  to  mutation  screening  We  utilized  a 
two-step  approach  to  BRCAl  mutation  screening  whereby  all  coding 
exons  of  BRCAl  were  sequenced  in  genomic  DNA,  and  RNA  was 
analyzed  by  SSCP  in  families  where  genomic  mutations  were  not 
seen.  With  this  approach  we  have  identified  BRCAl  mutations  in 
16  of  20  families  (80%)  with  both  breast  and  ovarian  cancer. 
Mutations  were  found  in  10  of  11  (91%)  families  with  lod  scores 
greater  than  0.50;  and  in  15  of  17  families  (88%)  where  the  mean 
age  of  diagnosis  of  breast  cancer  was  less  than  50  years. 
Mutations  were  not  found  in  four  families.  One  of  these  has 
compelling  evidence  of  being  linked  to  BRCAl.  The  other  three 
kindreds  with  no  BRCAl  mutations  were  not  clearly  linked, 
suggesting  the  possibility  of  involvement  of  another  predisposing 
gene.  One  previous  study  used  genomic  sequencing  as  a  screening 
method.  Simard  et  al.  (1994)  detected  BRCAl  mutations  in  13  of 
30  Canadian  families  with  three  or  more  cases  of  breast  or 
ovarian  cancer,  but  mutations  were  identified  in  six  of  eight 
families  (75%)  with  two  cases  of  ovarian  cancer  and  two  cases  of 
early-onset  breast  cancer. 

Three  studies  used  SSCP  as  the  primary  method  of  mutation 
screening  (Castilla  et  al.  1994;  Friedman  et  al.  1994;  Streuwing 
et  al.  1995).  Castilla  et  al.  (1994)  detected  germline  BRCAl 
mutations  in  only  eight  of  50  familial  breast  cancer  cases. 
However,  only  10  of  the  50  families  had  lod  scores  of  1.0  or 
greater  to  BRCAl.  Seven  of  the  eight  families  positive  for 
mutations  contained  cases  of  ovarian  cancer.  Friedman  et  al. 
(1994)  also  used  SSCP  and  identified  BRCAl  mutations  in  10  of  20 
breast  cancer  and  breast-ovary  families  with  "convincing  evidence 


8 


of  linkage"  (lod  scores  were  not  given) .  Streuwing  et  al. 
(1995)  used  SSCP  to  screen  for  mutations  in  24  families  with  at 
least  three  cases  of  breast  or  ovarian  cancer  (all  families 
contained  at  least  one  case  of  ovarian  cancer) .  Mutations  were 
detected  in  only  10  of  the  24  families,  but  were  found  in  all 
four  families  with  lod  scores  >  0.30. 

Hogervorst  et  al.  (1995)  screened  for  BRCAl  mutations  using  the 
protein  truncation  test  (PTT) .  Mutations  were  identified  in  only 
six  of  35  patients  from  breast  or  breast/ovarian  cancer  families 
in  this  study.  However,  mutations  were  found  in  four  of  the  five 
families  with  lod  scores  greater  than  1.0.  In  our  study,  it  could 
be  anticipated  that  PTT  would  detect  mutations  in  13  families. 
In  six  of  these  13  families  (families  4001,  2651,  2770,  1234, 
2090  and  3173)  the  expression  of  the  mutant  allele  was 
significantly  reduced.  It  is  not  yet  clear  if  this  test  would 
perform  adequately  under  these  circumstances. 


In  general,  the  mutation  yield  in  our  study  and  in  previous 
studies  is  much  higher  for  families  with  strong  evidence  of 
linkage  to  BRCAl  (e.g.  lod  scores  >  0.5) .  The  yield  is  also 
higher  in  families  with  clear-cut  evidence  of  the  breast-ovarian 
cancer  syndrome  (at  least  two  cases  of  ovarian  cancer  and  a  least 
two  cases  of  breast  cancer  before  the  age  of  60) .  It  is 
estimated  that  92%  of  families  satisfying  these  criteria  are 
attributable  to  BRCAl  mutations  (Narod  et  al.  1995) .  It  is 
difficult  to  evaluate  the  sensitivity  of  the  SSCP  and  PTT  tests 
against  the  method  of  direct  sequencing  at  present,  because  it  is 
not  clear  which  negative  tests  are  false  negatives.  It  will  be 
necessary  to  perform  the  three  tests  on  a  coimon  set  of  families 
if  comparisons  of  sensitivity  are  to  be  meaningful. 


Mutation  types  The  majority  of  mutations  in  our  study  (81%) 
resulted  in  truncated  BRCAl  proteins,  as  reported  by  Shattuck- 
Eidens  et  al.  (1995) .  However,  we  found  a  different  distribution 
of  mutation  subtypes  within  this  class,  notably  a  relative  excess 
of  nonsense  and  splice-site  mutations  (table  3) .  These 
differences  are  likely  to  be  the  result  of  the  methods  used  for 
screening  in  the  different  studies.  In  our  study,  both  genomic 
DNA  and  cDNA  were  analysed.  The  approach  of  direct  DNA 
sequencing  is  expected  to  lead  to  a  greater  rate  of  detection  of 
single  base-pair  substitutions  (such  as  nonsense  mutations)  than 
is  SSCP.  Few  previous  studies  have  looked  for  regulatory 
mutations,  or  for  other  non-coding  mutations  generating  an 
unstable  or  mutant  BRCAl  transcript.  In  addition,  our  series 
included  families  selected  for  both  breast  and  ovarian  cancer. 


Genotype / Phenoy tpe  Correlations  Overall,  we  did  not  find  a 
significant  Influence  of  the  position  of  the  BRCAl  mutation  on 
the  appearance  of  the  breast  and  ovarian  cancers  in  these 
families,  either  in  terms  of  the  ratio  of  breast  to  ovarian 
cancers,  or  in  terms  of  the  average  age  of  onset  of  these 
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cancers.  In  particular,  we  were  unable  to  confirm  the 
speculation  of  Shattuck-Eidens  et  al  (1995)  that  5'  mutations 
confer  a  greater  risk  of  ovarian  cancer.  Of  the  two  families 
with  the  greatest  number  of  ovarian  cancers  in  this  series,  one 
had  a  mutation  in  intron  18  (near  the  3'  end)  and  the  other  did 
not  have  any  mutation  identified  in  the  BRCAl  coding  region. 
Among  the  14  informative  families  with  BRCAl  mutations,  we  found 
a  reduction  in  RNA  levels  in  six  families,  and  equal  expression 
of  the  two  alleles  in  eight.  There  is  a  suggestion  that  the  risk 
of  ovarian  cancer  is  greater  in  families  with  mutations 
associated  with  reduced  RNA  levels.  However,  it  should  be 
noted  that  within  this  subgroup  was  a  family  with  3  ovarian  and 
14  breast  cancers,  and  another  with  10  ovarian  and  9  breast 
cancers.  In  the  subgroup  of  eight  families  with  normal  BRCAl 
RNA  levels,  the  penetrance  of  ovarian  cancer  was  higher  in  three 
families  with  extreme  5'  mutations  than  in  the  other  five. 
These  preliminary  observations  require  confirmation.  It  may  be 
that  other  factors,  genetic  and  non-genetic,  have  greater 
importance  in  determining  cancer  penetrance  than  the  allelic 
variation  of  the  BRCAl  gene  itself . 

Issues  in  Genetic  Counselling  It  is  possible  to  offer  predictive 
testing  in  some  families  for  the  breast~ovarian  cancer  syndrome 
using  linkage  analysis  alone  (Lynch  et  al,  1994),  but  the  process 
is  greatly  facilitated  when  direct  mutation  detection  is  also 
available.  We  have  taken  a  comprehensive  approach  to  risk 
assessment  which  includes  both  linkage  analysis  and  direct 
mutation  analysis  detection.  In  several  cases,  the  combined 
approach  has  allowed  us  to  provide  evaluations^  when  a  single 
approach  was  insufficient.  We  have  provided  risk  assessments 
based  on  DNA  analysis  to  over  200  individuals  from  10  of  these 
families.  Haplotype  and  mutation  analyses  are  done  in  different 
laboratories,  and  the  co-segregation  of  the  identified  mutation 
with  the  associated  haplotype  is  confirmed  prior  to  risk 
communication.  Although  we  were  unable  to  detect _  a  BRCAl 
mutation  in  family  1816  the  multipoint  lod  score  for  linkage  to 
chromosome  17q  markers  in  this  family  is  greater  than  3.0,  and 
the  probability  of  BRCAl  linkage  is  estimated  to  be  above  99%. 
We  have  provided  DNA-based  risk  estimates  to  over  100  individuals 
in  this  single  family  based  on  linkage  analysis  alone. 

The  linkage  results  for  family  2651  were  difficult  at  first  to 
interpret.  The  affected  persons  in  the  left  hand  branch  of  the 
pedigree  (individuals  67,  60,  107  and  120;  figure  2)  did  not 
share  an  affected  haplotype  with  the  five  affected  women  in  the 
middle  branch  (individuals  1,  6,  29,  45,  and  39).  Although  the 
segregation  of  the  (3, 5, 3, 6, 2)  haplotype  in  the  latter  five  women 
was  consistent  with  a  BRCAl  mutation  inherited  from  individual 
69,  we  were  unwilling  to  provide  risk  assesments  based  on  this 
assumption.  The  finding  of  the  5  base-pair  deletion  in  exon  11 
segregating  with  this  haplotype  confirmed  this  pattern  of 
inheritance.  The  family  history  of  individual  69  was  then 
extended,  and  additional  cases  of  cancer,  including  a  case  of 
male  breast  cancer  in  individual  452,  were  revealed.  Genetic 
risks  have  now  been  given  to  family  members  based  on  the  combined 
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mutation  and  linkage  data. 


The  principal  considerations  for  offering  predictive  testing 
are  the  specificity  and  sensitivity  of  the  test  employed.  In  the 
absence  of  strong  evidence  of  linkage,  a  negative  mutation  test 
is  not  useful;  i.e.,  it  is  not  possible  to  reassure  individuals 
that  they  are  not  at  elevated  risk,  because  of  the  possibility 
that  a  mutation  may  have  been  missed,  and  because  of  genetic 
heterogeneity.  However,  once  a  mutation  is  identified  in  an 
affected  individual  it  is  possible  to  offer  accurate  predictive 
testing  to  unaffected  women  in  a  family.  Currently,  we  feel  that 
there  is  sufficient  evidence  to  conclude  that  a  mutation  that 
leads  to  premature  protein  truncation  is  a  true  positive,  and  we 
counsel  families  accordingly.  In  contrast,  we  are  reluctant  to 
offer  counselling  based  solely  on  the  presence  of  a  missense  or 
an  inferred  regulatory  mutation,  and  we  require  confirmatory 
linkage  evidence.  Our  data  do  not  support  the  hypotheses  that 
S'  mutations  carry  a  greater  risk  of  ovarian  cancer  than  do  3' 
mutations.  Because  we  believe  there  is  no  convincing  evidence 
yet  for  genotype/ phenotype  correlations,  we  feel  that  it  is 
premature  to  modify  our  risk  estimates  based  on  the  position  of 
the  BRCAl  mutations. 
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Table  1.  Breast-Ovary  Cancer  Families  used  in  the  Analysis 


Family 

Number 

Number  of 

Breast 

Cancers 

Average 

Age  of 

Diagnosis 

Number  of 

Ovarian 

Cancers 

Average 

Age  of 

Diagnosis 

LOD 

Score 

Probability 

of  Linkage 

to  BRCAl 

32 

10 

45.2 

2 

68.0 

-0.41 

80 

1086 

11 

41.4 

1 

65 

0.38 

91 

1234 

9 

43.7 

10 

54.1 

1.00 

99 

1252 

7 

37.6 

1 

84 

0.73 

96 

1813 

5 

36.2 

1 

51 

0.54 

94 

1816 

14 

40.1 

11 

50.4 

3.62 

99 

1882 

6 

55.8 

1 

73 

-0.03 

80 

1973 

5 

35.6 

1 

53.0 

0.52 

93 

2090 

7 

32.3 

2 

41.5 

0.03 

92 

2619 

2 

53.5 

4 

59.5 

0.16 

94 

2651 

5 

45.0 

2 

52 . 0 

-0.05 

91 

2770 

14 

39.6 

3 

47.7 

1.56 

99 

2775 

10 

43.7 

1 

47 

1.46 

99 

2850 

16 

41.6 

2 

46.5 

1.26 

99 

2944 

5 

35.4 

2 

47 

0.80 

99 

2979 

2 

38.4 

2 

47 

1.15 

99 

3079 

3 

49.7 

5 

53.6 

0.45 

97 

3173 

1 

51 

4 

54.5 

0.28 

96 

3300 

4 

40.5 

1 

47 

0.50 

93 

4001 

7 

40.7 

2 

43.0 

0.30 

96 
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Footnote  to  table  1.  The  two-point  lod  scores  are  based  on 
linkage  to  the  intragenic  chromosome  17q  markers  D17S855  under  a 
model  previously  described  (Feunteun  et  al,  1993),  except  for 
family  1816  which  is  based  on  the  D17S579  and  THRAl  markers.  The 
probabilities  of  linkage  are  based  on  the  estimates  by  Narod  et  al 
(1995) ,  and  incorporate  the  lod  score  information.  Family  2651 
contains  a  case  of  male  breast  cancer. 
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Table  2.  BRCAl  Mutations  in  20  Families  Studied 


Family  Type  Exon  Codon  Nucleotide  Mutation  Loss  of 
Number  Transcript 


2979 

FS 

2 

22 

185 

del  AG  -  ter  39 

No 

3079 

FS 

2 

22 

185 

del  AG  -  ter  39 

No 

2775 

FS 

2 

23 

188 

del  11  -  ter39 

No 

1252 

MS 

5 

61 

300 

Cys-Gly  (T-G) 

No 

2850 

SP 

5 

- 

- 

exon  5  missing 

in  transcript 

1086 

FS 

11 

266 

916 

del  TT  -  ter  285 

No 

4001 

NS 

11 

526 

1695 

Gln-Ter  (C-T) 

Yes 

2944 

11 

908 

2841 

Glu-Ter  (G-T) 

No 

3300 

FS 

11 

1160 

3598 

del  11  -  ter  1166 

No 

2651 

FS 

11 

1234 

3819 

del  5  -  ter  1242 

Yes 

2770 

NS 

16 

1563 

4808 

Tyr-Ter  (C-G) 

Yes 

1234 

SP 

Intron 

- 

- 

del  A  - 

Yes 

18 

del  exon  19  - 

- 

ter  1732 

1973 

FS 

20 

1756 

5382 

ins  C  -  ter  1829 

No 

1813 

NS 

24 

1835 

5622 

Arg-Ter  (C-T) 

Not  done 

2090 

IR 

- 

- 

- 

- 

Yes 

3173 

IR 

- 

- 

- 

- 

Yes 

FS  Frameshift;  MS  Missense; 

NS  Nonsense;  IR  Inferred 

Regulatory 

Transcript 

analysis 

was  not 

done  for  family  1813  because  RNA  was 

unavailable 

from 

an  informative 

(heterozygous) 

individual 

carrying  the  BRCAl 

mutation 

in  this 

family. 
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Table  3.  Frequency  of  Different  Types  of  BRCAl  Mutations 


Type  of  Mutation 

Mutations 

Mutations 

Found  ( % ) 

Reported  (%) 

Frameshift 

7 

(43.8) 

45 

(71.4) 

Nonsense 

4 

(25.0) 

6 

(9.5) 

Splice  site 

2 

(12.5) 

2 

(3.2) 

Missense 

1 

(6.3) 

9 

(14.3) 

Inferred  Regulatory 

2 

(12.5) 

1 

(1.6) 

Total 

16 

63 

Footnote:  Mutations  reported  are  from  Shattuck-Eidens  et  al 

(1995)  . 
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LEGENDS  FOR  FIGURES 


Figure  1.  Analysis  of  the  relative  amount  of  BRCAl  transcripts 
produced  from  mutant  and  normal  alleles  in  lymphocytes  of  BRCAl 
mutation  carriers  in  family  2090.  PCR  fragment  N  of  exon  11 
(Friedman  et  al,  1994)  was  amplified  from  lymphocyte  cDNA  and 
genomic  DNA  was  sequenced  and  the  intensities  of  the  two  alleles 
A  and  G  of  the  single  base  polymorphisms  at  position  3667  was 
compared.  The  A  allele  was  found  to  segregate  with  cancer 
susceptibility  in  this  family.  1  and  2:  sequences  from  cDNA  in 
two  different  carrier  individuals;  3  and  4:  corresponding  genomic 
sequences  for  these  individuals;  5:  cDNA  sequence  of  noncarrier 
in  this  family  heterozygous  for  the  polymorphism.  Note  the 
reduced  intensity  of  the  G  allele  (arrow)  in  the  cDNA  from 
carriers,  but  not  in  the  genomic,  or  non-carrier  DNA. 


Figure  2.  Pedigree  of  family  2651.  Black  circles  indicate 
affected  women.  Diagonal  slash  indicates  deceased.  Individual 
identification  numbers  appear  directly  below  the  symbols.  Br  = 
breast  cancer,  Ov  =  ovarian  cancer,  co  =  colon  cancer,  Gb  = 
cancer  of  gall  bladder,  Csu  =  cancer,  site  unknown,  Sto  = 
stomach  cancer.  Pro  =  prostate  cancer,  En  =  endometrial  cancer, 
Bt  =  Brain  tumour,  Lu  =  lung  cancer.  The  numbers  following 
these  abbreviations  indicate  ages  of  diagnosis,  where  known.  The 
numbers  arranged  vertically  below  the  individual  symbols  indicate 
the  marker  alleles  arranged  into  haplotypes.  Haplotypes  in 
brackets  are  inferred.  Marker  alleles  separated  by  a  comma  cannot 
be  phased.  A  dash  in  the  place  of  a  marker  typing  indicates 
missing  information.  The  haplotype  in  the  vertical  rectangle  is 
that  observed  to  segregate  with  the  BRCAl  mutation.  The  presence 
of  the  mutation  is  indicated  by  an  asterisk,  the  N  refers  to  the 
absence  of  this  mutation. 


Figure  3.  Pedigree  of  family  32.  Symbols  as  in  figure  1.  Eso  = 
esophageal  cancer;  Lip  =  cancer  of  the  lip. 


Figure  4.  Pedigree  of  family  2619.  Symbols  as  in  figure  1. 


19 


ACKNOWLEDGEMENTS 


We  thank  the  family  members  and  the  physicians  who  contributed  to 
this  work.  We  thank  MF  Lavoue,  C  Bonnardel,  S  Delaloge,  T  Conway, 
J  Lynch,  P  Watson  and  S  Slominski  for  their  expert  assistance. 
This  work  was  supported  by  grants  from  the  Ligue  Nationale  Centre 
le  Cancer,  the  Canadian  Genetics  Diseases  Network  and  the 
Department  of  the  Army  (US)  Grant  number  DAMD17-94-J-4299  and  the 
National  Institutes  of  Health  grants  IROl  CA  63678-01.  OS  is  a 
fellow  of  the  fondation  Merieux.  MM  is  a  fellow  of  the 
Associazione  Italiana  per  la  Ricerca  sur  Cancer  (AIRC)  and  is  a 
recipient  of  an  lARC  special  training  award.  SN  is  a  fellow  of 
the  Fonds  de  la  Recherche  en  Sante  du  Quebec  and  PT  is  supported 
by  the  Medical  Research  Council  of  Canada. 


20 


Original  Contributions 

A  Collaborative  Survey  of  80  Mutations 
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Cathryn  Lswis.  PhO;  Mark  H,  Skolnick.  PhD:  David  E.  Goidgar.  PhD 


Objectives  —To  report  the  initial  experience  of  an  intemationai  group  of  inves¬ 
tigators  in  identifying  mutations  in  the  BRCA1  breast  and  ovanan  cancer  suscep¬ 
tibility  gene,  to  assess  the  spectrum  of  such  mutations  in  samples  from  patients  with 
different  family  histories  of  cancer,  and  to  determine  the  frequency  of  recurrent  mu- 

Desiqn. — Nine  laboratories  in  North  America  and  the  United  Kingdom  tested  for 
BRCA1  mutations  in  DNA  samples  obtained  from  a  total  of  372  unrated  patients 
with  breast  or  ovanan  cancer  largely  chosen  from  high-risk  families.  Three  of  these 
laboratories  also  analyzed  a  total  of  714  additional  samples  irom  breast  or  ovanan 
cancer  cases,  including  557  unselected  for  family  history,  for  two  speciric  mutations 
that  had  been  found  to  recur  in  familial  samples. 

Participants.— A  total  of  1086  women  with  either  breast  or  ovanan  caricer 

Main  Outcome  Measure.- The  detection  of  sequence  vanation  in  patients  DNA 

samples  that  is  not  found  in  sets  of  control  samples. 

Results. _ BRCA 1  mutations  have  now  been  identified  in  a  total  of  80  patient  sam- 

oies  Thirty-eight  distinct  mutations  were  found  among  63  mutations  identified 
through  a  complete  screen  of  the  BRCA  1  gene.  Three  specific  mutations  appeared 
relatively  common,  occurring  eight,  seven,  and  five  times,  respectively  When  speci¬ 
fic  tests  for  the  two  most  common  mutations  were  performed  in  larger  sets  of 
samples  they  were  found  in  17  additional  patients.  Mutations  predicted  to  result  in 
a  truncated  protein  accounted  for  86%  of  the  mutations  detected  by  complete 

^'^Conclusions.- The  high  frequency  of  protein-terminating  mirations  and  the 
observation  of  many  recurrent  mutations  found  in  a  diverse  set  of  samples  could 
lead  to  a  relatively  simple  diagnostic  test  for  BRCA  1  mutations.  More  data  must  be 
accumulated  to  address  specifically  the  sensitivity  and  specificity  of  such  a 
diagnostic  testing  procedure  and  to  better  estimate  the  age-specific  nsk  for  breast 
and  ovarian  cancer  associated  with  such  mutations. 


A  compieie  nsl  or  author  affiliations  aopears  at  the 
end  of  this  article.  Ors  Skolnick,  Lewis,  and  Goidgar 
have  stock  options  in  Myriad  Genetics,  which  may  of¬ 
fer  aiagnostic  testing  for  BRCA  1  in  the  future.  Or  Skol¬ 
nick  IS  vice  oresioent  of  Mynaa  Genetics. 

Reonm  reauests  to  Oeoartmentef  Medical  Informat¬ 
ics.  University  cf  Utan  School  or  Medicine.  391  Chioeta 
Way.  Suite  02.  Salt  Lake  City.  UT  34108  (Or  Goiagar). 


A  PROMISING  approach  for  reducing 
the  high  incidence  and  mortality  associ¬ 
ated  mth  breast  and  ovarian  cancer  lies 
in  the  early  detection  of  women  at  high 
risk.  These  women,  once  identified,  can 
be  targeted  for  more  aggressive  preven¬ 
tion  programs.  Because  the  strongest 


kno\\Ti  epidemiologic  risk  factor  for  breast 
cancer  is  a  positive  family  history^  and 
because  studies  of  bi*east  cancer  patients 
and  their  relatives  consistently  find  sta¬ 
tistical  evidence  for  involvement  of  an 
autosomal  dominant  gene.~"  the  identifi¬ 
cation  of  specific  genetic  effects  have  long 
been  the  focus  of  etions  to  identin^  wom¬ 
en  at  liigh  lisk.  She  months  ago,  a  specific 
gene,  BRCAl.^  which  cornel's  greatly  in¬ 
creased  susceptibility^  to  breast  and  ovai'- 
ian  cancer,  was  isolated,'^  creating  great 
interest  among  oncologists  and  among 
women  with  a  pei-sonal  or  farfy  history 
of  breast  cancer.  Tlie  focus  of  this  article 
is  on  issues  in  diagnostic  testing  spe¬ 
cifically  related  to  BRCAl,  while  the 
generi  issues  of  risk  assessment  and 
counseling -for  individuals  with  a  family 
histoiy  of  breast  cancer  ai'e  addressed 
elsewhere  in  this  issue.’* 

BRCAl  is  a  lai*ge  gene,  containing  5592 
nucleotides  spread  over  approximtely 
100  000  bases  of  genomic  DNA:  it  is  com¬ 
posed  of  22  coding  exons  producing  a  yp- 
tein  of  1863  amino  acids.  Much  of  BR  CAl 
shows  no  homology  to  other  known  genes, 
with  the  exception  of  a  126  nucleotide 
sequence  at  the  amino  terminus,  which 
©ncodes  a  RING  finger  motif  (a  conngu- 

ration  characteristic  of  the  jRLVG-l  gene*), 
a  motif  found  in  ocher  proteins  that  in¬ 
teract  with  nucleic  acid  and/or  form  pro-' 
tein  complexes.  This  may  suggest  a  role 
for  BRCAl  in  DNA  transcription. 
Mutations  in  BRCAl  result  from  dif¬ 
ferent  types  of  DNA  alteration.  Two 
classes  of  mutations  result  from  a  single 
nucleotide  substitution.  If  the  substitu¬ 
tion  changes  a  single  amino  acid  but  does 
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affect  the  remainder  of  the  protein 
/  translation,  it  is  termed  a  misseiise  mu¬ 
tation.  Nonsense  mutations  occur  when 
the  nucleotide  substitution  produces  a 
stop  codon  (TGA,  TAA,  or  TAG),  and 
translation  of  the  protein  is  terminated 
at  that  point,  Frameshift  mutations  oc¬ 
cur  when  one  or  more  nucleotides  are 
either  inserted  or  deleted.  If  the  num¬ 
ber  of  bases  inserted  or  deleted  is  not 
divisible  by  three,  a  change  in  the  read¬ 
ing  frame  alters  the  remainder  of  the 
translation  of  the  protein  being  altered; 
most  often  a  stop  signal  is  encountered 
prematurely.  Another  class  of  mutation, 
intron/exon  splice-site  mutations,  can  re¬ 
sult  from  either  single  base  changes  or 
the  insertion  or  deletion  of  one  or  more 
nucleotides  in  the  intronic  sequence. 
Splice-site  mutations  cause  abnormal 
inclusion  or  exclusion  of  DNA  in  the 
coding  sequence,  resulting  in  an  aber¬ 
rant  protein.  A  final  class  of  mutation 
occurs  when  a  mutation  in  a  gene's 
regulatory  region  causes  reduction  or 
loss  of  protein  s\mthesis  from  the  mu¬ 
tant  chromosome.  Such  regulator}''  mu¬ 
tations  usually  occur  outside  of  the  cod¬ 
ing  sequence  of  a  gene  and  can  be  either 
nucleotide  substitutions  or  insertion/de¬ 
letion  events. 

There  are  several  methods  that  can 
be  used  to  detect  mutations.  Direct  DNA 
sequencing,  either  manual  sequencing 
or  automated  fluorescent  sequencing, 
can  detect  sequence  variation.  For  a  gene 
as  large  as  BRCAl,  manual  sequencing 
is  labor  intensive,  but  tmder  optimal  con¬ 
ditions,  mutations  in  the  coding  sequence 
of  a  gene  are  rarely  missed  using  this 
method.  Another  approach  is  the  single- 
strand  conformation  polymorphism 
(SSCP)  assay.^  This  method  does  not 
detect  all  sequence  changes,  especially 
if  the  DNA  fragment  size  is  greater  than 
200  base  pairs,  but  can  be  optimized  to 
detect  most  DNA  sequence  variation. 
The  reduced  detection  sensitirity  is  a 
disadvantage,  but  the  increased  through¬ 
put  possible  with  SSCP  make  it  an  at¬ 
tractive,  viable  alternative  to  direct  se¬ 
quencing  for  mutation  detection  on  a 
research  basis.  The  fragments  that  have 
shifted  mobility  on  SSCP  gels  are  then 
sequenced  to  determine  the  exact  na¬ 
ture  of  the  DNA  sequence  variation. 
Other  approaches  based  on  detection  of 
mismatches  between  the-  two  comple- 
mentar}-  DNA  (cDNA)  strands  include 
clamped  denaturing  gel  electrophoresis*' 
and  heteroduplex  analysis.'*'  None  of  the 
methods  just  mentioned  will  detect  large 
deletions,  duplications,  or  insertions, 
which  might  affect  one  or  more  com¬ 
plete  e.xons  or  even  the  entire  gene,  nor 
will  they  detect  a  regulator}'  mutation, 
which  affects  transcription  or  transla¬ 
tion  of  the  protein.  Other  methods  that 


might  detect  these  classes  of  mutations, 
such  as  the  protein  truncation  assay," 
detect  only  specific  types  of  mutations 
and  would  not  detect  missense  muta¬ 
tions.  A  review  of  currently  available 
methods  of  detecting  DNA  sequence 
variation  can  be  found  in  a  recent  re¬ 
view  by  Grompe.^  Once  a  mutation  is 
known,  an  allele-specific  detection  ap¬ 
proach,  such  as  allele-specific  oligo¬ 
nucleotide  (ASO)  hybridization,  can  be 
used  to  screen  large  numbers  of  other 
samples  rapidly  for  that  same  mutation. 

The  utility  of  any  screening  test,  ge¬ 
netic  or  otherwise,  depends  on  three 
basic  factors:  the  prior  probability  of 
having  the  condition  of  interest,  the  sen¬ 
sitivity  of  the  test  (the  chance  that  a 
person  with  the  condition  will  have  a 
positive  test  result),  and  the  specificity 
of  the  test  (the  chance  that  a  person 
without  the  condition  will  have  a  nega¬ 
tive  test  result).  For  BRCAl,  we  must 
consider  the  probability  that  a  woman 
diagnosed  with  breast  cancer  at  a  par¬ 
ticular  age  or  with  a  specific  family  his- 
tory  harbors  a  mutation  in  BRCAl ;  that 
a  mutation  will  be  detected  using  a  given 
screening  technique;  and  that  a  benign 
sequence  variant  will  be  interpreted  as 
a  deleterious  BRCAl  mutation. 

Much  of  what  is  currently  khowm  about 
the  risks  of  cancer  due  to  BRCAl  sus¬ 
ceptibility  comes  from  analyses  of  data 
collected  as  part  of  an  international  ef¬ 
fort  to  study  breast  cancer  families  for 
linkage  to  the  BRCAl  region  of  chro¬ 
mosome  17q.'‘^'*^The  probabilities  of  vari¬ 
ous  constellations  of  personal  and  fam¬ 
ily  history  being  attributable  to  BRCAl 
can  be  calculated  from  statistical  mod¬ 
els  of  the  penetrance  of  the  BRCAl  lo¬ 
cus.  These  penetrance  estimates  have 
been  obtained  from  analyses  of  large 
numbers  of  572CAi-linked  kindreds  and 
on  analysis  of  population-based  studies 
of  familial  breast  and  ovarian  cancer. 
The  probability  of  a  woman  with  breast 
or  ovarian  cancer  bearing  a  BRCAl  mu¬ 
tation  based  on  such  modeling  is  given 
in  Table  1.  These  risks  may  be  overes¬ 
timates  since  they  depend  in  part  on 
results  obtained  from  Mgh-risk  families 
suitable  for  linkage  analysis.  Now  that 
the  BRCAl  gene  has  been  isolated,  more 
precise  estimates  of  the  proportion  of 
women  diagnosed  with  breast  and  ovar¬ 
ian  cancer  at  a  given  age  and  with  a 
given  family  history  that  is  attributable 
to  BRCAl  mutations  can  be  determined. 

In  an  analysis  of  33  families  with  at 
least  four  cases  of  breast  or  ovarian  can¬ 
cer  and  with  sufficient  linkage  evidence 
to  yield  a  posterior  probability  of  17q 
linkage  of  greater  than  0.9,  Easton  et 
aP*' found  significant  statistical  evidence 
for  heterogeneity  in  the  risk  of  ovarian 
cancer;  it  was  estimated  that  89%  of 


Table  1  .->-Esti mated  Prior  Probabilities  of  a  Par¬ 
ticular  Constellation  of  Cases  Being  Due  to  the 
BRCAl  Gene 


Criterion,  Cancer 
Diagnosed,  Age  (y) 

Prior  Probabilib/ 
Case  Has  a 
BRCAl  Mutation* 

Single  affected 

Breast  <30 

0.12 

Breast  <40 

0.06 

Breast  40-49 

0.03 

Ovanan  <50 

0.07 

Sister  pairs 

Breast  <40,  breast<40 

0.37 

Breast  40^9.  breast  40-49 

0.20 

Breast  <50.  ovanan  <50 

0.46 

Ovanan  <50,  ovarian  <50 

0.61 

Families 

Breast  only,  >3  cases 

diagnosed  <50 

0.40 

^2  breast  with  a  1  ovarian 

0.82 

s2  breast  with  >2  ovarian 

0.91 

*  Estimated  using  the  model  of  Easton  et  for  pen¬ 
etrance  of  BRCA 1  and  the  estimates  of  heterogeneity 
reported  in  Easton  et  at.” 


families  had  a  relatively  low  risk  of  ovar¬ 
ian  cancer -(26%  by  age  70  years),  and 
the  remaining  11%  of  families  had  a  much 
higher  risk  (85%).  Interestingly,  no  such 
heterogeneity  w^as  observed  in  breast 
cancer  risk.  The  presumed  explanation 
for  this  finding  is  allelic  heterogeneity: 
that  is.  different  mutations  may  corner 
inherently  different  risks  of  ovarian  can¬ 
cer.  Now  that  BRCAl  has  been  isolated, 
this  h}'pothesis  can  be  tested  directly. 
Screening  and  prevention  recommenda¬ 
tions  w’ould  var}'  according  to  the  aee- 
specific  breast  cancer  risk  and  the  ovar¬ 
ian  cancer  risk  associated  with  a  specific 
mutation. 

In  the  current  article,  we  use  data 
gathered  from  the  preliminary  experi¬ 
ence  of  many  groups  of  investigators  in 
screening  the  BRCAl  gene  to  begin  an¬ 
swering  some  of  these  questions.  These 
investigators  screened  samples  ascer¬ 
tained  from  a  wide  variety  of  sources 
and  used  a  variety  of  techniques  to  de¬ 
tect  BRCAl  mutations.  Included  in  this 
article  are  the  original  BRCAl  muta¬ 
tions  reported  by  Miki  et  al^  and  Futreal 
et  al,"  the  mutations  found  in  30  Cana¬ 
dian  families  by  Simard  et  ai,'^  the  analy¬ 
sis  of  Michigan  families  by  Castilla  et 
al,'®  and  the  analysis  of  Friedman  et  aP 
of  the  20  families  studied  by  the  Berke¬ 
ley  (Calif)  group.  In  addition,  subsequent 
mutations  by  these  and  other  groups 
are  reported  herein  for  the  first  time. 
The  samples  analyzed  range  from  those 
shown  wnth  high  probability  by  linkage 
analysis  to  segregate  BRCAl  mutations, 
to  small  families  with  two  or  three  af¬ 
fected  individuals  and  no  linkage  data, 
to  samples  from  early-onset  cases  un¬ 
selected  for  family  history.  Our  goal  in 
this  initial  review  is  to  give  an  early 
view  of  what  the  mutation  profile  might 
look  like  and  what  it  implies  for  the  com¬ 
plexity  of  an  eventual  diagnostic  test. 
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METHODS 


Nine  laboratories  in  Noith  America 
and  the  United  Kingdom  screened  a  to¬ 
tal  of  372  samples  for  BRCAl  muta¬ 
tions.  derived  from  patients  with  breast 
or  ovarian  cancer,  drawn  prmarily  from 
high-risk  families.  Three  of  these  labo¬ 
ratories  tested  for  two  recurrent  muta-  • 
dons  found  in  the  familial  samples,  us¬ 
ing  714  additional  samples  derived  from 
breast  or  ovarian  cancer  cases,  includ¬ 
ing  557  unselected  for  family  history. 

The  DN  A  samples  that  were  screened 
for  BRCAl  mutations  were  extracted 
from  blood  or  tumor  samples  from  pa¬ 
tients  with  breast  or  ovarian  cancer  (or 
known  carriers  by  haplotype  an^ysis) 
who  were  participating  in  research  stud¬ 
ies  on  the  genetics  of  bre^t  cancer.  All 
subjects  signed  appropriate  informed 
consent  forms.  Table  2  details  the  num¬ 
ber  of  samples,  ascertainment  criteria, 
and  screening  method  for  each  set  ot 
samples  screened.  Although^  the 
nal  mutations  described  by  Mild  et  ai 
were  detected  through  screening  of 
cDNA,  25  pairs  of  intronic  polymerase 
chain  reaction  primers  were  used  to  am¬ 
plify  the  complete  coding  sequence  and 
splice  junctions  from  genomic  DNA  for 
the  majority  of  the  remaining  samples. 


Updated  primer  imormadon  is  publisned 
in  Castilla  et  al''-*  and  Friedman  et  al.- 
AU  putative  mutations  were  presumeci 
to  be  resequenced  to  verify  the  original 
obseiwation.  Where  possible. 
quence  variations  were  tested  lor  co¬ 
segregation  with  breast  or  ovanan  can¬ 
cer  in  the  family.  Further  eridence  oi  a 
causal  role  of  a  sequence  variant  m  can¬ 
cer  was  provided  in  some  c^es  by  prov¬ 
ing  the  absence  of  the  putative  mutation 
in  a  set  of  control  individuals.  Screenmg 
for  specific  previously  identified  muta¬ 
tions  in  large  sets  of  selected  samples 
was  performed  using  .ASO  hybridiza¬ 
tion.  -- 

In  a  preliminary  attenipi  to  test  ii 
alterations  in  certain  domains  oiBRC^ 
were  more  likeiv  to  be  associated  with 
a  particularly  high  proportion  of  cases 
of  ovarian  cancer  in  these  families,  ive 
divided  the  samples  in  which  mutations 
were  found  as  follows; 

1.  All  proband  samples  from  families 
with  a  minimum  of  three  cases  of  ovar¬ 
ian  cancer  in  which  the  number  of  breast 
cancer  cases  was  no  more  than  twice  the 
number  of  ovarian  cancer  cases  were 
classified  as  having  a  high  proportion  ot 
ovarian  cancer  cases. 

2.  Those  samples  from  families  with 
at  least  three  cases  of  breast  cancer  and 


in  which  the  number  of  breast  cancer 
cases  was  more  than  twice  the  number 
of  ovaiian  cancer  cases  w'ere  denoted  as 
having  a  low  proportion  of  ovarian  can¬ 
cer  cases. 

3.  .All  other  samples  w'ere  classmea 
as  having  an  indeterminate  or  unknown 
propoition  of  ovarian  cancer  cases. 

These  classifications  roughly  corre¬ 
spond  to  a  classification  based  on  the 
posterior  probability  of  being  a  type  I  or 
type  II  family  in  the  model  of  Easton  et 

al.‘® 

Tests  that  the  observed  mutations 
were  distributed  uniformly  across  the 
coding  region  of  BRCAl  and  compari¬ 
sons  of  the  locations  of  mutations  ac¬ 
cording  to  the  classification  of  high  and 
low  proportion  of  ovaiian  cancer  cases 
just  described  were  performed  using  the 

one-sampleandtwo-sampleKolmogorov- 

Smimov  tests,  respectively. 


RESULTS 

We  distinguish  mutations  identified 
by  analyzing  the  entire  coding  region  of 
BRCAl  from  those  in  which  screening 
is  targeted  to  specific  regions  of  the  gene 
or  specific  mutations  via  ASO  analysis. 
Table  3  describes  each  of  the  63  mum- 
tions  found  by  screening  the 
BRCAl  coding  sequence  plus  the  il 
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Table  4.— Frequency  of  Two  Commonr  8RCA 1  Mutations  Detected  Through  Targeted  Screening  in  Defined 
Samples 


Glasgow»lt _ _ _ ^*0  _ ] _ ^ 

Glasgow-2  _ 9B _ __0 _  2 


Glasgow-3 _  100 _ Not  tested _ Z 

CRC-ovanan _ _ _ 2SQ _ Not  tested _ i 

Milan,  Italy  _  50 _ ^  ^ 

•use  indicates  University  of  Southern  California.  Los  Angeles:  MSKCC.  Memorial  Sloan-Kettenng  Cancer  Center. 
New  Yor1<.  NY;  and  CRC,  CRC  Group.  Cambridge.  England, 
t Excludes  one  family  m  which  the  5382  ins  C  was  found  by  direct  sequencing. 


intron/exon  boundaries  and  by  finding 
polymorphic  sites  in  genomic  DNA  re- 
duced  to  monomorphic  sites  in  cDNA. 
Two  relatively  common  mutations  were 
found,  and  their  frequencies  in  other 
samples  were  examined  by  ASO  analy¬ 
sis  ^Table  4).  Table  5  describes  the  dis¬ 
tribution  of  BRCAl  mutations  by  type 
of  mutation.  The  majority  of  mutations 
identified  were  fraraeshifts  that  ac¬ 
counted  for  58%  of  the  36  different  mu¬ 
tations  found  to  date  in  BRCAl  and  in 
71%  of  all  samples  shown  to  have  BR  CAl 
mutations  thi'ough  complete  screening. 
The  Figure  depicts  the  location  of  each 
observed  mutation  within  BRCAl  and 
indicates  the  frequencies  of  each  recur¬ 
rent  mutation.  Two  tests  for  a  i*andom, 
uniform  distribution  of  mutations  across 
BRCAl  were  performed,  one  on  all  63 
observed  mutations  and  one  on  the  36 
unique  mutations  obser\^ed  in  the  cod¬ 
ing  sequence.  The  first  test  showed  a 
significant  departure  from  a  uniform  dis¬ 
tribution  (P=.02),  reilecting  the  multiple 
occurrences  of  the  three  frequent  mu¬ 
tations.  However,  the  locations  of  the  36 
unique  mutations  did  not  depart  from  a 
uniform  distribution  (P>.20). 

In  an  attempt  to  identify  a  genetic 
basis  for  the  statistical  observation  of 
heterogeneity  of  ovarian  cancer  risk  de¬ 
scribed  by  Easton  et  al,‘®  we  examined 
the  distribution  across  BRCAl  of  the 
mutations  from  the  16  families  with  a 
high  proportion  of  ovarian  cancer  and 
the  31  families  with  a  relatively  low  pro¬ 
portion  of  ovarian  cancer.  Although  mu¬ 
tations  found  in  samples  from  each  of 
the  three  groups  of  families  denned  by 
the  proportion  of  affected  women  who 
had  ovarian  cancer  were  found  through¬ 
out  the  BRCAl  coding  sequence,  mu¬ 
tations  in  the  3'  portion  of  the  gene  may 
be  associated  with  families  with  a  lower 
proportion  of  ovarian  cancers  in  affected 
women.  For  example,  only  four  (25%)  of 
the  16  families  with  a  relatively  high 
proportion  of  ovarian  cancer  have  mu¬ 
tations  that  lie  in  the  3'  third  of  the 
BRCAl  gene,  while  52%  (16/'31)  of  those 
families  denoted  as  having  a  low  ovarian 


cancer  prevalence  have  mutations  that 
are  found  in  this  region  (P=.08).  To  test 
this  hypothesis  across  the  entfre  BRCAl 
sequence  we  compared  the  spatial  dis¬ 
tributions  of  the  locations  of  mutations 
found  in  the  high-proportion  families 
with  those  from  low-proportion  fami¬ 
lies  using  a  two-sample  Koiomogorov- 
Sminiov  test.  Tins  comparison  failed  to 
show'  a  significant  difference  between 
the  two  distributions  of  mutation  loca¬ 
tions  (P=.20). 

COMMENT 

Mutational  Spectra  in  BRCA1 

From  the  data  shown  in  Tables  3  and 
4.  it  is  clear  that  mutations  in  BRCAl 
have  now  been  found  in  a  diverse  set  of 
samples  by  several  different  laborato¬ 
ries  using  a  variety  of  screening  meth¬ 
ods.  Moreovei',  mutations  have  been 
found  in  many  different  regions  of  the 
gene:  phenoty’pically  severe  mutations 
have  been  found  both  in  the  extreme  5' 
end  of  BRCAl  and  in  the  extreme  3' 
ponion  of  the  gene.  One  such  mutation 
found  in  a  family  with  seven  early-onset 
breast  cancer  cases  produces  a  protein 
that  is  only  missing  the  terminal  10  amino 
acids,  indicating  that  this  region  of 
BRCAl  plays  a  role  in  normal  gene  fime- 
tion.  It  is  noteworthy  that  the  over¬ 
whelming  majority  of  alterations  in 
BRCAl  have  been  either  frameshift  or 
nonsense  mutations.  Of  the  total  of  63 
independent  individuals  with  mutations 
identified  by  complete  screening  of  the 
coding  sequence,  54  (86%)  are  frame- 
shift.  nonsense,  splice,  or  regulatory  mu¬ 
tations  (Table  5),  which  result  in  an  trun¬ 
cated  protein  product.  Although  the  ma¬ 
jority  of  mutations  in  BRCAl  identified 
to  date  are  frameshifts,  it  may  be  that 
these  mutations  are  simply  more  readily 
detectable.  When  exhaustive  searches 
for  mutations  in  samples  from  a  large 
set  of  high-risk  families  that  have  a  veiy 
high  likelihood  of  harboring  BRCAl  mu¬ 
tations  have  been  completed,  more  pre¬ 
cise  estimates  of  the  frequency  of  each 
type  of  mutation  will  be  obtained. 


Table  5. — Observed  Frequency  of  Different  Types 
of  Mutations 


Mutation 

Typo 

Distinct 

Mutations, 

No.  (%)* 

(n=38) 

All 

Mutations, 
No.  (%)t 
(n=63) 

Frameshift 

23  {61} 

45  (71) 

Nonsense 

6  (16) 

6(10) 

Missense 

7(18) 

9(14) 

Other 

2(5) 

3(5) 

•Identical  mutations  are  counted  only  once  in  this 
column. 

fEach  samole  in  which  a  mutation  has  been  identified 
is  counted  in  this  column. 


There  is  as  yet  no  apparent  clustering 
of  mutations  in  BRCAl  based  on  the 
data  depicted  in  the  Figure,  using  the 
Kolmogorov-Smimov  test  described 
herein.  Twenty-two  of  the  38  distinct 
mutations  were  found  in  exon  11:  how¬ 
ever,  because  this  e.xon  contains  approxi¬ 
mately' 60%^of  the  coding  sequence  of 
BRCAulhis  is  almost  exactly  what  we 
would  expect,  assuming  a  uniform  dis¬ 
tribution  across  BRCAl.  The  only  pos¬ 
sible  evidence  of  clusteiing  is  found  in 
the  RING  finger  motif,  which  repre¬ 
sents  2.3%  of  the  total  coding  sequence 
but  contains  foui'  different  mutt.tions 
.(11%). 

Frequency  of  Recurrent  Mutations 

BRCAl.  like  many  other  cancer  pre¬ 
disposing  genes,  is  characterized  by  some 
mutations  that  occur  relatively  fre¬ 
quently.  In  BRCAl,  to  date,  three  mu¬ 
tations  appeal*  to  be  relatively  common: 
The  6382  ins  C  mutation  in  codon  1756. 
the  186  del  AG  mutation  in  codon.  23. 
and  the  6181  del  4  mutation  in  codon 
1355  were  identified  by  dfrect  sequenc¬ 
ing  or  3SCP  in  seven  (11%),  eight  ( 13%), 
and  five  (8%)  of  the  63  probands  in 
which  mutations  have  been  identified, 
respectively.  In  addition  to  these  three 
common  mutations,  eight  additional  mu¬ 
tations  have  been  found  twice  by  a  com¬ 
plete  screen  of  the  cDN  A.  The  total  n-ac- 
tion  of  mutations  that  are  recurrent  is 
31%,  and  these  11  recurrent  mutations 
comprise  57%  of  the  63  mutations  de¬ 
tected  in  probands.  Simard  et  aP^  found 
that  the  multiple  occurrences  of  the  5382 
ins  C  and  186  del  AG  mutations  shared 
a  common  ancestral  haplotype  for  ge¬ 
netic  markers  within  or  close  to  BRCAl, 
indicating  that  either  these  mutations 
are  old  or  that  the  families  are  more 
closely  related  than  originally  believed. 
It  will  be  interesting  to  perform  similar 
haplotype  comparisons  among  cases  of 
these  mutations  from  the  additional  fami¬ 
lies  reported  herein  with  these  two  mu¬ 
tations  and  the  other  recuirent  muta¬ 
tions. 

Many  of  the  probands  screened  to  date 
for  BRCAl  mutations  were  selected  for 
having  a  high  prior  probability  of  hav- 
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BRCA1  transcript  showing  the  locations  of  mutations  shown  in  Table  3.  The  7  transcript  contains  24 
exons  (shown  as  boxes),  but  exons  1  and  4  do  not  contain  coding  sequence.  The  solid  black  bar  extending 
from  exon  2  to  exon  5  indicates  the  location  of  the  RING  finger  homology.  Frameshift  mutations  occur  at 
locations  shown  with  vertical  lines  above  the  complementary  DNA  diagram,  and  missense  mutations  are 
shown  similarly  below  the  diagram.  The  vertical  line  topped  with  an  asterisk  indicates  the  splice  mutation 
in  intron  6  that  inserts  59  base  pairs  to  exon  6  and  results  in  a  frameshift.  The  height  of  each  vertical  line 
corresponds  to  the  number  of  times  each  mutation  was  observed.  The  arrows  above  the  diagram  refer  to 
nonsense  mutations,  none  of  which  were  observed  more  than  once.  Thus,  ail  mutations  shown  above  the 
diagram  result  in  premature  termination  of  the  BRCA1  protein. 


ing  BRCAl  mutations.  Thus,  the  mu¬ 
tations  found  in  this  set  may  not  be 
representative  of  those  that  would  be 
identified  in  other  sets  of  patients.  How¬ 
ever,  the  two  most  frequent  BRCAl 
mutations  have  been  found  multiple 
times  in  targeted  screening  in  sets  of 
probands  who  were,  for  the  most  part, 
either  unselected  for  familj'’  history  or 
ascertained  with  minimal  family  histon^ 
(Table  4),  At  present,  nine  (2,5%)  of  364 
samples  from  breast  or  ovarian  cancer 
patients  screened  by  ASO  have  been 
round  to  have  the  185  del  AG  mutation 
and  another  eight  (1%)  of  814  had  the 
5882  ins  C  mutation.  It  is  important  to 
note  that  six  of  these  mutations  were 
found  in  premenopausal  breast  cancer 
cases  selected  without  regard  to  family 
history  and  that  three  mutations  were 
found  in  unselected  ovarian  cancer  cases. 
Thus,  the  mutations  that  were  common 
in  the  linked  and  otherwise  high-risk 
families  also  have  been  found  repeat¬ 
edly  in  probands  who  are  more  repre¬ 
sentative  of  a  general  clinic  population. 

Factors  Affecting  Sensitivity 

There  are  a  number  of  families  re¬ 
ported  in  which  the  genetic  linkage  evi¬ 
dence  for  involvement  of  BR  CAl  is  very 
strong  but  for  which  mutations  have  not 
been  identified.  For  example,  Michigan 
families  28  and  30  that  have  multiple 
cases  of  ovarian  cancer  and  convincing 
evidence  for  imkage^*"  fall  into  this  cat¬ 
egory",  as  do  a  number  of  families  stud¬ 
ied  by  the  Berkeley  group.  Although 
these  could  be  due  to  coding  mutations 
missed  by  SSCP,  similar  families  have 
been  found  among  ^oups  using  direct 
sequencing  (eg,  family  102  in  Simard  et 
al^**  and  kindred  1925  in  Miki  et  aF).  Such 
families  could  be  analyzed  initially  for 
the  presence  of  a  single  transcript  (in¬ 
dicating  that  on  one  chromosome,  the 


copy  of  BRCAl  is  not  being  properly 
transcribed  from  genomic  DNA  into 
messenger  RNA)  through  analysis  of 
the  polymorphisms  described  in  the  re¬ 
gion  in  cDNA  and  genomic  DNA®  or  by. 
a  functional  assay  when  available.  The 
frequency  of  these  presumed  regulatory 
mutations  lying  outside  of  the  regions 
currently  being  analyzed  (exons  and  in- 
cron/'exon  junctions)  will  affect  the  util¬ 
ity  of  any  sequence-based  test  for 
BRCAl  mutations. 

Factors  Affecting  Specificity 

In  the  preceding  paragraphs,  we  have 
focused  on  factors  affecting  the  sensi- 
ti\ity  of  an  assay  for  BRCAl  mutations, 
ie,  the  chance  that  a  true,  clinically  mean¬ 
ingful  alteration  in  the  BRCAl  gene  will 
be  detected  using  a  particular  screening 
method.  However,  we  also  must  con¬ 
sider  the  specificity  of  such  an  assay,  ie, 
the  rate  of  false  positives.  In  practical 
terms  this  usually  involves  the  proper 
interpretation  of  observed  missense  mu¬ 
tations.  How  can  one  be  certain  that  an 
observed  amino  acid  substitution  found 
in  a  patient  or  relative  of  a  patient  is 
clinically  meaningful?  Missense  muta¬ 
tions,  which  are  recurrent  in  cancer  cases 
but  absent  or  very  rare  in  the  general 
population,  are  likely  to  be  dinicaily 
meaningful;  cose^egation  of  the  mis¬ 
sense  mutation  with  the  disease  in  rela¬ 
tives  of  the  proband  provides  further 
support  that  the  missense  mutation  is 
indeed  causal.  Unfortunately,  there  may 
still  be  ambiguity  regarding  the  causal 
nature  of  missense  mutations.  This  dif¬ 
ficulty  is  exemplified  by  a  Ser  1010  Asn 
putative  missense  mutation,  which  was 
described  as  a  polymorphism  by  Cas¬ 
tilla  et  al,^^  where  it  was  found  not  to 
segregate  with  the  disease  and  was  also 
detected  in  three  of  232  control  chro¬ 
mosomes.  In  the  report  of  Friedman  et 


al,^  however,  this  alteration  did  segre¬ 
gate  with  the  disease  in  the  family  in 
which  it  was  identified  and  was  absent 
in  120  control  chromosomes.  Another 
example  is  provided  by  Utah  family 
K2039  in  which  a  missense  mutation 
changing:  an  arginine  to  a  glycine  in 
amino  acid  1347  was  detected  and  found 
to  be  absent  in  156  control  chromosomes. 
However,  this  same  variant  was  found 
in  a  patient  studied  by  the  Berkeley 
group  who  also  had  a  severe  frameshift 
mutation:  thus,  its  functional  significance 
is  questionable,  and  it  was  not  included 
as  a  mutation  in  Table  3.  A  functional 
assay,  if  available,  could  be  useful  for 
distinguishing  causal  mutations  from 
neutral  polymorphisms  in  cases  in  w-hich 
the  interpretation  of  a  missense  muta¬ 
tion  is  problematic.  The  current  indica¬ 
tions  that  a  large  fraction  of  mutations 
lead  to  truncation  of  the  BRCAl  protein 
may  allow  use  of  a  truncation  assay  for 
BRCAl  mutations.  Such  an  assay  has 
been  described  for  mutation  screening 
of  the  ARC  gene,”^  Before  the  trunca¬ 
tion  assay  is  feasible  for  large-scale 
screening  we  need  to  determine  the  ac¬ 
tual  frequency  of  timncating  mutations; 
the  current  estimate  may  be  biased  due 
to  the  ascertainment  criteria  used  in  test¬ 
ing  probands. 

Allelic  Heterogeneity 

An  important  question  with  implica¬ 
tions  for  providing  counseling  based  on 
BRCAlI  testing  is  the  existence  of  allelic 
heterogeneity.  That  is,  do  different  mu¬ 
tations,  or  classes  of  mutations,  confer 
different  age-specific  risks  of  breast  and/ 
or  ovarian  cancer.  Although  these  ques¬ 
tions  can  be  most  definitively  addressed 
through  prospective  follow-up  of  rela¬ 
tives  of  patients  with  mutations  identi¬ 
fied  thi'ough  screening  of  population- 
based  case  series,  we  have  attempted  to 
answer  one  such  question  based  on  the 
data  provided  in  this  article.  Given  the 
statistical  e\'idence  of  heterogeneity  of 
ovarian  cancer  risk  from  linkage  studies 
of  breast/ovarian  cancer  families,  we  at¬ 
tempted  to  identify  whether  mutations 
in  specific  regions  of  BRCAl  might  be 
associated  with  the  occurrence  of  a  par¬ 
ticularly  high  or  low  number  of  ovarian 
cancers  relative  to  the  number  of  breast 
cancers.  Based  on  this  analysis,  there 
jvas  not  a  statistically  significant  differ¬ 
ence  in  the  distribution^  the^C.^ 
mutational  spectra  among  low-  and  high- 
prevalence  ovarian  cancer  families.  How¬ 
ever.  there  was  some  evidence  that  mu¬ 
tations  nearer  the  3'  end  of  the  gene 
were  associated  with  families  with  a 
lower  proportion  of  ovarian  cancers  in 
affected  women.  It  is  ciear  that  much 
more  data  will  be  necessary  to  address 
this  question  with  any  degree  of  preci- 
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sion.  Mutations  identified  in  samples 
from  probands  selected  with  a  well-de¬ 
fined  mode  of  ascertainment  and  wth 
subsequent  follow-up  of  ail  mutation  car¬ 
riers  within  the  family  will  be  especially 
useful  in  this  regard. 

Interpretation  of  a  Negative 
BRCA1  Test  Result 

It  should  be  emphasized  that,  at  pres¬ 
ent,  failure  to  find  a  mutation  in  an  at- 
risk  individual  is  only  useful  clinically  if 
a  BRCAl  mutation  has  been  identified 
in  an  affected  first-degree  relative.  At- 
risk  women  who  have  not  inherited  the 
mutant  BRCAl  allele  present  in  their 
affected  relative  would  have  breast  and 
ovarian  cancer  risk  at  rates  equal  to 
chose  in  the  general  population.  Fami¬ 
lies  in  which  no  BRCAl  mutation  has 
been  identified  will  have  a  risk  that  com¬ 
bines  the  probability  of  nondetection 
’.vith  risks  based  on  empiric  data  for 
women  with  a  particular  family  histoiy, 
such  as  those  provided  by  Claus  et  al.- 
In  all  cases,  risk  calculations  also  should 
incorporate  nongenetic  risk  factor's,  par¬ 
ticularly  if  they  are  shown  to  interact 
with  the  genetic  susceptibility  conferred 
by  BRCAl.  A  more  detailed  discussion 
of  risk  assessment  for  breast  cancer  and 
associated  social  and  ethical  issues  have 
been  described^  and  are  discussed  in  an 
accompanying  article  in  this  issue.'* 

In  this  article,  we  have  deliberately 
avoided  making  any  specific  recommen¬ 
dations  as  to  the  most  effective  method 
of  screening  for  BRCAl  mutations,  and 
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Women  who  carry  mutations  in  the  BRCA1  gene  on  chromosome  17q  have  an  85% 
lifetime  risk  of  breast  cancer,  and  a  60%  risk  of  ovarian  cancer.  We  have  identified 
BRCA1  mutations  in  12  of  30  (40%)  Canadian  families  with  breast  and/or  ovarian 
cancer,  including  six  of  the  eight  families  (75%)  that  contained  two  cases  of  early-onset 
breast  cancer  and  two  cases  of  ovarian  cancer.  Six  frameshift  mutations  account  for  all 
12  mutant  alleles,  including  nucleotide  insertions  (two  mutations)  and  deletions  (four 
mutations).  Four  independent  families  carried  the  same  1  basepair  (bp)  insertion  mutation 
in  codon  1 755  and  four  other  families  shared  a  2  bp  deletion  mutation  in  codons  22-23. 
These  families  were  not  known  to  be  related,  but  haplotype  analysis  suggests  that  the 
carriers  of  each  of  these  mutations  have  common  ancestors. 


In  1990,  a  gene  for  hereditary  breast  and  ovarian  cancer, 
designated  BRCAU  was  mapped  to  chromosome  17q  by 
linkage  analysis Earlier  this  year,  it  was  identified  by 
positional  cloning^  It  was  estimated  that  92%  of  families 
which  contain  two  or  more  cases  of  early-onset  breast 
cancer,  and  two  or  more  cases  of  ovarian  cancer  carry 
mutations  in  BRCAl  (ref.  4).  Miki  and  colleagues  were 
able  to  identify  BRCAl  mutations  in  five  of  six  extended 
breast  cancer  kindreds  which  showed  linkage  to  the  BRCAl 
locust  The  BRCAl  gene  contains  22  coding  exons,  which 
are  distributed  over  approximately  100  kilobases  (kb), 
and  codes  for  a  protein  of  1 ,863  amino  acids.  The  function 
of  the  BRCAl  protein  is  not  yet  known,  but  it  is  predicted 
to  contain  a  zinc  finger  domain  which  is  characteristic  of 
proteins  which  regulate  gene  transcription. 

To  establish  the  frequency  and  spectrum  of  mutations 
in  the  BRCAl  gene  in  families  with  the  breast  and  ovary 
cancer  syndrome,  we  have  sequenced  the  BRCAl  coding 
region  in  an  index  case  from  30  breast  cancer  and  breast- 
ovarian  cancer  families  from  Canada.  We  now  describe 
six  novel  mutations  in  BRCAl  in  12  of  these  families. 
Haplotype  analysis  of  intragenic  and  flanking  markers 
provided  evidence  for  a  common  origin  for  each  of  the 
mutations  that  was  observed  in  several  families. 

Mutations  in  the  BRCAl  gene  and  haplotypes 

The  30  families  with  early-onset  breast  cancer  and/or 
ovarian  cancer  were  identified  through  the  Hereditary 
Cancer  Clinic  of  McGill  University  (Table  1 ) .  Nine  families 
had  positive  evidence  for  linkage  to  the  intragenic  D1 75855 
locus  (lod  scores  greater  than  0.25).  Six  families  were 
uninformative  for  linkage  (lod  scores  -0.04  to  0. 14).  Two 
families  gave  evidence  against  linkage  (lod  scores  less  than 


-0.8).  The  other  13  families  were  not  tested  for  linkage 
because  DNA  was  not  available  from  multiple  affected 
individuals. 

The  exons  of  the  BRCAl  coding  region  and  the  exon- 
intron  splicing  boundaries  were  sequenced  in  one  or 
more  members  of  each  family.  For  a  small  number  of 
cases  DNA  was  not  available  on  an  affected  member.  For 
these,  other  gene  carriers  were  chosen  for  sequencing; 
these  were  either  obligate  carriers,  or  were  individuals 
who  had  been  previously  shown  to  carry  the  haplotype  of 
risk.  To  study  haplotype  association,  family  members 
were  typed  using  four  microsatellite  polymorphisms 
located  within  or  near  BRCAl;  from  centromere  to 
telomere,  these  are:  D17S855,  D17S1322,  D17S1323  and 
DJ7SJ327  (refs  6,7)  (Table  2). 

Direct  sequencing  of  PGR  products  of  the  coding  region 
and  exon-intron  boundaries  of  BRCAl  revealed  the 
presence  of  mutations  in  12  of  the  30  families  studied  (Fig 
la;  Table  3).  The  mutation  designated  185delAG  results 
from  a  deletion  of  one  of  two  consecutive  AG  dinucleotides 
of  the  normal  sequence  TTA  GAG  of  codons  22-23.  This 
mutation  changes  the  reading  frame  of  the  mRNA  and 
causes  a  premature  termination  codon  at  position  39  in 
exon  3.  This  mutation  was  detected  in  index  cases  from 
four  families  ( 1 80, 235, 253  and  255)  that  were  not  known 
to  be  related  and  which  originated  from  different  areas  in 
Canada.  In  these  four  families  there  were  a  total  of  12  cases 
of  breast  cancer  and  1 1  cases  of  ovarian  cancer.  These  four 
families  appear  to  share  a  common  haplotype  (GCCE)  for 
the  four  markers  tested  (in  families  180  and  253,  the 
haplotypes  could  not  be  determined  unambiguously 
because  only  a  single  affected  individual  was  typed; 
however  the  genotypes  were  consistent  with  their  being 
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Table  1  Characteristics  of  30  cancer  families 
which  have  been  tested  for  BRCA1  mutations 


Family 

Number  of 
breast  cancers 

Number  of 
ovarian  cancers 

Lod  score  for 
linkage  to 
D17S855 

Probability  of 
linkage 
to  BRCA1  (%) 

101 

2 

2 

0.08 

95.2 

102 

5 

1 

0.68 

95.3 

113 

4 

1 

_ 

81 

121 

2 

1 

0.01 

81.3 

130 

3 

0 

-0.04 

42.7 

136 

2 

1 

_ 

81 

161 

4 

1 

- 

81 

162 

3 

1 

- 

81 

164 

6 

1 

- 

81 

166 

5 

2 

- 

81 

172 

0 

1 

- 

- 

178 

4 

1 

0.27 

88.8 

179 

2 

3 

- 

92 

180 

2 

2 

- 

92 

183 

4 

1 

0.53 

93.5 

185 

1 

3 

0.91 

98.9 

186 

3 

0 

0.06 

- 

211 

2 

1 

0.26 

88.5 

213 

3 

1 

-0.03 

79.9 

218 

5 

1 

0.33 

90.0 

235 

4 

2 

-0.82 

63.4 

240 

0 

3 

0.27 

95.5 

247 

1 

3 

- 

92 

248 

3 

1 

- 

81 

253 

1 

3 

- 

92 

254 

2 

2 

-1.05 

50.6 

255 

6 

7 

1.05 

99.2 

259 

2 

1 

- 

81 

270 

4 

3 

0.69 

98.3 

279 

4 

0 

0.14 

53.0 

Lod  scores  for  linkage  between  BRCA1  and  the  intragenic  marker,  D17S855, 
were  computed  using  the  penetrances  as  described  previously^°.  The 
probabilities  of  linkage  are  estimated  by  incorporating  the  lod  score  and  the 
following  prior  probability  of  linkage:  45%  for  families  with  no  case  of  ovarian 
cancer;  81  %  for  families  with  one  case  of  ovarian  cancer;  and  92%  for  families 
with  two  or  more  cases  of  ovarian  cancer. 


Table  2  Definitions  of  alleles  with  sizes  and  frequencies 


Locus 

D17S855 

D17S1322 

D17S1323 

D17S1327 

probe 

{AFM248yg9) 

(s754) 

(s975) 

(ED2) 

Allele 

size 

freq.  (%) 

size 

freq.  (%) 

size  freq.  (%) 

size 

(n) 

A 

158 

- 

134 

4 

161 

2 

173 

B 

156 

5 

131 

3 

159 

4 

171 

C 

154 

19 

128 

11 

157 

19 

169 

D 

152 

18 

125 

25 

155 

3 

167 

(1) 

E 

150 

15 

122 

51 

153 

4 

165 

(2) 

F 

148 

12 

119 

4 

151 

68 

163 

(1) 

G 

146 

26 

116 

1 

149 

- 

161 

(5) 

H 

144 

5 

147 

- 

159 

1 

142 

- 

139 

_ 

157 

J 

155 

K 

153 

L 

135 

M 

133 

(6) 

N 

131 

0 

129 

Allele  sizes  are  in  basepairs.  Allele  frequencies  are  derived  from  ref.  13 
(D17S855)  and  from  GDB.  Newly  identified  alleles.  For  the  D17S1327 
polymorphism,  the  numbers  of  observed  alleles  of  particular  size  were  based 
on  15  control  chromosomes.  Allele  M  is  the  most  frequent  in  the  present  study 
and  in  independnet  individuals  from  CEPH  families  (0.63). 


carriers  of  the  at- risk  haplo  type).  This 
haplotype  was  not  present  in  any  of 
the  26  index  cases  who  did  not  carry 
this  particular  mutation  (Fisher's 
exact  best,  p<0.0001). 

Four  different  frameshift  mutations 
were  detected  in  exon  11.  The 
mutation  designated  1128insA  was 
detected  in  the  index  case  of  family 
270.  This  mutation  consists  of  an 
insertion  of  a  single  A  in  the  normal 
sequence  GAA  AAA  AAG  of  codons 
337-339.  This  mutation  leads  to  a 
premature  termination  codon  at 
position  345.  The  mutation 
designated  1293del40  was  found  in 
family  185  (Fig,  1^).  It  is  a  40  bp 
deletion  that  includes  codons  392- 
404  and  the  first  nucleotide  of  codon 
405.  This  mutation  leads  to  a  stop 
codon,  5  codons  distal  to  the  deletion 
and  a  predicted  truncated  BRCAl 
protein  of  396  amino  acids  in  length. 
In  family  218,  the  mutation 
designated  3121delA  was  detected 
(Fig.  1^).  This  mutation  is  the  result 
of  the  deletion  of  a  single  A  in  the 
normal  sequence  GAA  AAC  of  codons 
1001-1002  and  leads  to  a  premature 
termination  codon  at  position  1023. 
The  fourth  exon  11  mutation, 
designated  4184del4  was  found  in 
family  183  (Fig.  1  h).  This  is  a  deletion 
of  the  last  nucleotide  of  codon  1355 
and  all  of  codon  1356  (normal 
sequence  AAT  GAA),  and  produces  a 
termination  codon  at  position  1364. 
In  the  four  families  with  exon  11 
mutations,  there  were  14  cases  of 
breast  cancer  and  eight  cases  of 
ovarian  cancer. 

Four  other  unrelated  families  (101, 
162, 166  and  279)  shared  a  mutation 
designated  5382insC  in  exon  20, 
consisting  of  the  insertion  of  a  single 
C  in  codon  1755.  This  change  results 
in  a  stop  codon  at  position  1829  in 
exon  24.  One  of  these,  family  1 66,  was 
remarkable  because  ten  cases  of  cancer 
appeared  in  a  single  large  sibship, 
including  three  cases  ofbreast  cancer, 
two  ovarian  cancers,  two  leukaemias, 
two  pancreatic  cancers  and  one 
prostate  cancer  (Fig.  2a).  A  case  of 
leukaemia  and  a  case  of  Hodgkin’s 
disease  were  seen  in  more  recent 
generations.  Family  279  was  also 
unusual  —  one  obligate  carrier  was 
diagnosed  with  cancer  of  the  breast 
and  of  the  fallopian  tube,  and  another 
was  diagnosed  with  cancer  of  the 
prostate  (Fig.  2^).  In  the  four  families 
with  the  5382insC  mutation,  there 
were  1 4  cases  ofbreast  cancer  and  five 
cases  of  ovarian  cancer.  The  identical 
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Fig.  1  Mutational 
analysis  of  BRCA1 
in  breast  and 
ovarian  cancer 
families,  a, 
Schematic 
representation  of  the 
BRCA1  gene 
structure.  The  six 
frameshift  mutations 
and  the 
corresponding 
families  are 
indicated  below  the 
gene,  b,  Partial 
representative 
nucleotide  sequence 
and  corresponding 
amino  acids  of 
sense  strand  of 
BRCA1  exons, 
depicting  the 
mutations 
185delAG, 
1293del40, 
3121delA.  4184del4 
and  5382insC.  The 
sequences  of  the 
normal  alleles  for  the 
corresponding 
region  are  indicated 
on  the  right  of  each 
panel. 


mutation  was  previously  identified  in  Utah  kindred  1910 
(ref.  3).  The  four  Canadian  families  share  the  haplotype 
DEFO  (the  markers  in  family  1 66  could  not  be  phased,  but 
the  genotype  was  consistent  with  carrying  the  at-risk 
haplotype).  The  DEFO  haplotype  did  not  appear  in  any  of 
the  26  index  cases  who  didnot  carry  the  5382insC  (Fisher's 
exact  test,  p<0.001).  In  fact,  the  rare  O  allele  of  the 
D17SJ327  polymorphism  (129  bp)  was  not  observed  on 
any  of  the  52  chromosomes  from  index  cases  who  did  not 
have  this  particular  BRCAl  mutation  nor  on  80 


chromosomes  from  independent  individuals  of  CEPH 
families. 

Two  families  appeared  not  to  be  linked  to  chromosome 
17q,  but  were  investigated  because  each  contained  two 
cases  of  ovarian  cancer,  and  it  is  has  been  previously 
estimated  that  92%  of  families  of  this  type  will  carry 
BRCAl  mutations^.  In  one  of  these  (family  235),  a 
mutation  was  found  in  a  woman  with  ovarian  cancer  (Fig, 
2c).  Fler  sister  was  diagnosed  with  breast  cancer  at  age  47 
and  does  not  carry  the  185deIAG  mutation. 
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Discussion 

We  describe  six  different  mutations  in  the  BRCAl  gene  in 
a  total  of  12  breast-ovarian  cancer  families  out  of  30 
families  studied  (40%),  confirming  that  this  gene  is 
responsible  for  the  hereditary  breast  and  ovarian  cancer 
syndrome  which  has  been  mapped  to  chromosome  17q. 
Of  the  six  mutations,  two  were  each  seen  four  times  and 
four  were  unique.  One  of  the  mutations,  5382insC,  found 
in  four  Canadian  families,  was  the  first  identified  in  a  Utah 
kindredl  These  authors  also  identified  a  deletion  mutation, 
1 89del  1 1 ,  in  a  Utah  kindred.  W e  observed  a  nearby  deletion 
mutation,  185delAG,  in  four  unrelated  patients.  Both  of 
these  mutations  in  exon  2  are  predicted  to  cause  truncation 
at  the  beginning  of  the  zinc  finger  domain  of  the  putative 
polypeptide. 

In  general,  the  families  in  which  mutations  were  detected 
were  clear  examples  of  the  breast  ovarian-cancer  syndrome. 
Eight  of  the  1 2  families  with  mutations  contained  multiple 
cases  of  ovarian  cancer,  in  contrast  to  only  four  of  the  1 8 
families  in  which  mutations  were  not  found  (Fisher's 
exact  test  p=0.024).  Therefore,  it  was  possible  to  detect  a 
mutation  in  eight  of  the  12  families  (67%)  with  two  or 


more  cases  of  ovarian  cancer,  and  in  six  of  the  eight 
families  (75%)  with  two  cases  of  early-onset  breast  cancer 
and  two  cases  of  ovarian  cancer.  The  average  age  of  breast 
cancer  in  the  families  with  a  detectable  mutation  was  43.8 
years,  versus  55.5  years  for  the  1 8  families  in  which  we  did 
not  find  mutations  (Fig.  3;  Wilcoxon  test  of  the  difference 
in  the  age  distributions,  p< 0.0001). 

We  were  able  to  identify  mutations  in  five  of  nine 
families  with  lod  scores  greater  than  0.25  to  D17S855.  It 
was  not  possible  to  identify  linkage  in  all  families  with 
mutations;  of  the  12  families  with  mutations,  linkage 
analysis  was  not  feasible  in  four  because  we  were  unable  to 
obtain  the  necessary  samples,  two  other  families  were 
uninformative  (lods  of  0.08  and  0.14)  and  one  family  gave 
a  negative  lod  score  (lod— 0.82).  For  five  of  the  12  families 
with  mutations,  we  had  positive  evidence  for  linkage  (lods 
ranging  from  0.33-1.05).  Currently,  linkage  analysis  and 
direct  mutation  analysis  are  complementary,  techniques 
that  should  be  available  where  genetic  assessment  is  offered 
to  high-risk  families. 

The  nature  of  the  mutations  in  our  other  families  is  not 
yet  known.  It  is  possible  that  there  are  BRCAl  mutations 


Table  3  BRCA1  mutations  and  corresponding  haplotypes 


Family 

Mutation 

S855 

Haplotype 

S1322  SI  323 

SI  327 

101 

5382insC 

D 

E 

F 

O 

102 

- 

C 

E 

F 

M 

113 

_ 

B/E 

D/E 

F 

M 

130 

- 

C 

E 

F 

M 

121 

- 

F/H 

D 

C 

G/H 

136 

- 

F/G 

C/D 

F/G 

B/l 

161 

- 

F 

C/E 

D/F 

B/M 

162 

53821 nsC 

D 

E 

F 

O/M 

164 

- 

D/G 

B/F 

C/E 

E/G 

166 

5382insC 

D/G 

C/E 

C/F 

F/0 

172 

_ 

C/D 

D/E 

E/F 

M 

178 

- 

E 

C 

F 

J 

179 

- 

F/G 

D/E 

B/F 

G/M 

180 

185delAG 

G/H 

C/E 

C/F 

E/L 

183 

4184del4 

A 

F 

F 

M 

185 

1293del40 

F 

D 

F 

M 

186 

- 

G 

D 

C 

C 

211 

E 

C 

C 

E 

213 

- 

D 

E 

E 

M 

218 

31 21  del  A 

E 

E 

F 

M 

235 

185delAG 

G 

C 

C 

E 

240 

- 

E 

E 

F 

M 

247 

_ 

F 

D/E 

F 

M 

248 

- 

A/G 

D 

C/F 

D/M 

253 

185delAG 

G/H 

C/D 

B/C 

E/F 

254 

- 

D/H 

D 

C/F 

M/C 

255 

185delAG 

G 

C 

C 

E 

259 

C/F 

E/G 

E/F 

M 

270 

1128insA 

G 

D 

B 

D/1 

279 

5382insC 

D 

E 

F 

O 

No  mutation  identified.  Marker  alleles  correspond  to  those  in  Table  2.  Both  alleles  are  given  for 
markers  which  could  not  be  phased  with  respect  to  BRCA1. 
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outside  of  the  coding  region  which  may  result  in  an 
unstable  transcript^  It  is  also  possible  that  some  of  the 
family  clusters  are  not  due  to  mutations  in  BRCAL 
There  are  now  clear  examples  of  breast  and  ovarian 
cancer  families  which  are  unlinked  to  BRCAl;  a  small 
number  of  these  show  linkage  to  BRCA2  on 
chromosome  13  (ref.  8).  Another  possibility  is  that  we 
indavertently  chose  to  sequence  a  sporadic  case  in  a 
BRCAl -linked  family. 

If  BRCAl  protein  acts  as  a  tumour  suppressor,  we 
would  expect  a  large  proportion  of  the  mutations  to 
result  in  the  inactivation  of  one  allele  through  premature 
stop  codons.  In  our  families,  all  of  the  12  mutations  result 
in  frameshifts  that  lead  to  premature  stop  codons,  and  as 
predicted  from  the  tumour  suppressor  model,  the 
mutations  are  distributed  throughout  the  coding  region. 
Surprisingly,  somatic  mutations  in  BRCAl  have  not  yet 
been  found  in  tumour  tissue  from  women  with 
hereditary  or  non-familial  cancers^. 


There  were  two  recurring  mutations  among  the  six 
identified,  which  together  account  for  two-thirds  of  the 
observed  mutant  alleles.  One  ofthese,  5382insC,  was  strongly 
associated  with  a  rare  allele  ofthe  Di  7Si327polymorphism. 
A  wide  spectrum  of  cancers  was  observed  in  families  with 
this  mutation,  including  cancer  of  the  fallopian  tube, 
pancreas,  prostate  and  leukaemia  and  Hodgkin’s  disease.  It 
will  be  of  interest  to  see  if  these  observations  are  confirmed 
in  other  families  with  this  mutation. 

Our  data  suggest  that  screening  for  BRCAl  mutations  in 
high-risk  women  may  be  facilitated  if  common  mutations 
are  sought  first,  and  if  information  on  linkage  disequilibrium 
is  incorporated.  A  relatively  small  number  of  mutant  alleles 
may  account  for  the  majority  of  breast  and  ovarian  cancer 
families  in  Canada. 

Note  added  in  proof:  A  missense  mutation  was  identified  in 
family  179.  The  mutation  is  a  Thr  to  Lys  substitution  at 
codon  826  (ACA-^  AAA) .  It  was  not  detected  in  86  control 
individuals 


Fig.  2  Pedigrees 
of  three  breast 
cancer  families. 
a,  Family  166.  b, 
Family  279.  c. 
Family  235. 


a 

Family  166 


b 

Family  279  ^ 

Family  235 
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Table  4  Oligonucleotide  primers  for  BRCA1 

Exon  #  5'  primer  3'  primer 

2  GMGTTGTCATTTTATAAACCTTT  TGTCTTTTCTTCCCTAGTATGT 

3  CTCAGTTCCTGACACAGCAGAC  GGAGTTGGATTTTCGTTCTCACTTA 

4  GTCAAAGAGATAGAATGTGAGC  CCCGTCTCTACAGAAAACAC 

5  GGCTCTTAAGGGCAGTTGTGAG  CTTTTCCTACTGTGGTTGCTTCC 

6  AAGGTTGATAATCACTTGCTG  GTAATGTGCAAACTTCCTG 

7  GCATACATAGGGTTTCTCTTGG  TCGGGTTCACTCTGTAGAAG 

8  CATGTTAGCTGACTGATGATGG  GGAATCCAGCAATTATTATTAAATAC 

9  TACCTGCCACAGTAGATGCTCAG  AATAGGAAAATACCAGCTTCATAGA 

1 0  TAAGATTGGTCAGCTTTCTGTAATC  ACAGTACTGTATCTACCCACTCTC 

1 1  a  CTAAGTGTTCAAATACCAGTGAAC  ACTGGAGCCCACTTCATTAGTAC 

1 1  b  GATTTCCACCTCCAAGGTGTATGA  TGTGGCTCAGAAACAAATGCTCC 

1 1  c  CAGAGTTCACTCCAAATCAGTAGAG  AG  CTGAGTGCCATAATCAGTACCAGG 

1 1  d  CCAAGTACAGTGAGCACAATTA  AAATGTGCTCCCAAAAGCATAAA 

1 2  CCAGTCCTGCCAATGAGAAGAAA  TGTCAGCAAACCTAAGAATGT 

1 3  CATTTAATGGAAAGCTTCTCAAAG  AAATGTTGGAGCTAGGTCCTTAC 

1 4  ATTCTAACCTGAATTATCACTATCA  AAAGTGTATAAATGCCTGTATGCAAA 

1 5  TGGCTGCCCAGGAAGTATG  AACCAGAATATCTTTATGTAGGA 

1 6  AATTCTTAAC  AGAG  ACC  AGAAC  AAAACTCTTTCCAGAATGTTGT 

1 7  GTGTAGAACGTGCAGGATTG  TCGCCTCATGTGGTTTTA 

1 8  GAGGCTCTTTAGCTTCTTAGG  AAAGAGACCCATTTTCCCAGCA 

1 9  CTGTCATTCTTCCTGTGCTC  CATTGTTAAGGAAAGTGGTGC 

20  ATATGACGTGTCTGCTCCAC  GGGAATCCAAATTACACAGC 

2 1  AAGCTCTTCCTTTTTGAAAGTC  GTAG  AGAAATAG  AATAGCCTCT 

22  TCCCATTGAGAGGTCTTGCT  GAGAAGACTTCTGAGGCTAC 

23  CAGAGCAAGACCCTGTCTC  ACTGTGCTACTCAAGCACCA 

24  ATGAATTGACACTAATCTCTGC  GTAGCCAGGACAGTAGAAGGA 


Methodology 

Families.  The  30  families  analysed  in  this  study  were  referred  to  the 
McGill  University  Hereditary  Cancer  Clinic  at  the  Montreal  General 
Hospital  for  genetic  counselling  and  for  risk  assessment.  All  30 
families  are  Caucasian  and  principally  reside  in  Canada.  Sixteen 
families  were  previously  studied  for  linkage  to  BRCAl  (ref.  5).  In 
that  study,  it  was  estimated  that  94%  of  families  with  early-onset 
breast  cancer  and  ovarian  cancer  were  linked  families.  All  but  three 
families  contained  cases  of  ovarian  cancer,  and  all  but  two  contained 
cases  of  early-onset  breast  cancer.  Three  families  were  included 
because  they  contained  a  case  of  primary  cancer  of  the  peritoneum 
(families  130  and  172)  or  of  the  fallopian  tube  (family  279).  These 
cancer  sites  bear  a  histologic  relationship  to  epithelial  cancers  of 
the  ovary.  Family  186  was  studied  because  one  of  the  women  had 
multiple  primary  cancer  of  the  breast  and  endometrium,  and  the 
17q  haplotype  was  consistent  with  linkage  to  BRCAl  (data  not 
shown). 


step  was  considered  to  be  the  time  the  thermal  cycler  reached  94  °C 
from  the  annealing  step.  After  35  cycles,  a  final  extension  at  72  °C  for 
2  min  was  performed.  The  primers  were  removed  by  selective 
precipitation'-  and  the  PGR  products  were  then  subjected  to  a  35- 
cycle  asymmetric  amplification  under  the  same  conditions,  except 
for  the  use  of  1 0  nM  limiting  primer.  The  asymmetric  PCR  products 
were  then  purified  by  selective  precipitation. 

Single-stranded  DNA  produced  by  asymmetric  PCR  was 
sequenced  by  the  dideoxy  method  using  the  PCR  primer  or 
sequence-specific  primers  with  the  T7  sequencing  kit  (Pharmacia 
LKB  Biotechnologies)  as  described'^  Briefly,  the  single-stranded 
DNA  and  the  primers  were  mixed  together  in  the  annealing  buffer 
and  denatured  at  95  ‘'’C  for  5  min.  The  samples  were  then  quickly 
frozen  on  dry  ice-ethanol  and  the  sequencing  reactions  were 
performed  following  the  protocol  provided  by  the  manufacturer 
using  ^^S-dATP  or  ^^S-dCTP  as  the  labelled  nucleotide.  The  BRCAl 
coding  region  was  sequenced  entirely  for  each  family  except  for 


Linkage  analysis.  In  order  to  establish  probabilites  of  linkage,  lod 
scores  of  linkage  of  predisposition  to  breast  cancer  and/or  ovarian 
cancer  to  D17S855  were  computed  using  a  previously  described 
model*”.  The  distance  between  D17S855 ^nd  BRCAl  was  assumed 
to  be  0  cM  (ref.  3).  The  prior  probabilities  of  linkage  used  in  the 
calculation  were  45%,  81%  and  92%  for  families  with  0,  1  and  2 
cases  of  ovarian  cancer,  respectively^-". 

Mutation  analysis.  Genomic  DNA  was  isolated  from  blood  samples. 
DNA  from  a  representative  member  of  each  family  was  amplified  by 
PCR  using  25  pairs  of  intron-based  primers.  In  most  cases,  the 
individual  selected  was  a  woman  who  had  been  affected  with  early- 
onset  breast  cancer  or  with  ovarian  cancer.  In  some  families  for 
which  linkage  had  been  established,  genomic  sequencing  was 
performed  on  DNA  from  an  unaffected  individual  who  had  been 
shown  to  carry  the  BRCAJ  haplotype  of  risk.  Ten  families  (101, 102, 
113, 121, 136, 161, 166, 178, 179and270)  were  analysed  by  the  Utah 
group  by  DNA  sequencing  as  previously  described^  The  other  20 
families  were  analysed  in  Quebec  City.  PCR  was  performed  using  the 
26  pairs  of  primers  presented  in  Table  4,  in  a  50  pi  volume  containing 
10  mM  Tris-HCL,  pH  8.3,  50  mM  KCl,  1.5  mM  MgCl_„  50  pM  of 
dNTPs,  0.5  pM  of  each  primer,  0.1%  Triton  XlOO,  2%  DMSO  and 
100  ng  of  genomic  DNA.  The  reaction  was  first  heated  at  96  °C  for  5 
min  during  which  time  1  U  of  Taq  polymerase  (Perkin-Elmer)  was 
added.  The  reactions  were  carried  out  using  a  Perkin-Elmer  Cetus 
thermal  cyder  with  a  2 -step  temperature  cycle  consisting  of  40  s  of 
denaturation  at  94  °C  and  30  s  of  annealing  at  55  °C;  the  elongation 


21-  31-  41-  51-  61-  71- 

30  40  50  60  70  80 


Age  of  onset  (years) 

Fig.  3  Distribution  of  ages  of  onset  of  breast  cancer  in 
families  with  and  without  detectable  BRCA1  mutations,  a, 
Individuals  from  families  with  a  BRCA1  mutation  {n=35).  b. 
Individuals  from  families  without  a  BRCAl  mutation  (r?=41). 
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A  gene  for  hereditary  breast  and  ovarian  cancer, 
BRCA1,  has  been  mapped  to  chromosome  17q12  -q21. 
This  gene  is  responsibie  for  cancer  susceptibiiity  in  the 
majority  of  famiiies  with  muitipie  cases  of  ovarian 
cancer  and  eariy-onset  breast  cancer.  We  report  iinkage 
resuits  of  a  famiiy  with  10  cases  of  breast  cancer  and 
a  singie  case  of  ovarian  cancer.  A  recombinant  event 
in  this  famiiy  piaces  BRCA1  distai  (telomeric)  to  the 
iocus  EDH17B2,  which  codes  for  the  enzyme  estradioi 
17/3-dehydrogenase  ii.  This  recombinant  is  based  on 
the  appearance  of  breast  cancer  in  a  45  year  oid 
woman.  Under  our  genetic  modei,  we  estimate  the 
probabiiity  that  this  woman  carries  a  BRCA1  mutation 
to  be  94%.  These  data  further  reduce  the  region  of 
assignment  of  BRCA1  on  chromosome  17q12  -q21  and 
shouid  expedite  positionai  cioning  of  this  important 
gene. 

INTRODUCTiON 

Susceptibility  to  hereditary  breast  and  ovarian  cancer  has  been 
mapped  by  genetic  linkage  to  a  locus,  designated  BRCAl,  on 
chromosome  17ql2— q21  (1,2).  BRCAl  has  been  assigned  to 
the  13  cM  interval  between  the  loci  D17S250  and  D17S588  by 
a  consortium  analysis  of  214  cancer  families  (3).  Subsequently 
identified  genetic  recombinants  have  narrowed  the  centromeric 
boundary  of  BRCAl  successively  to  THRAl  (4),  to  RARA  (5), 
to  D17S857  (6),  to  D17S776  (7)  and  to  D17S702  (8)  (the  relative 
position  of  D17S776  and  D17S702  is  not  known).  The  telomeric 
boundary  is  currently  defined  at  D17S78  (5).  The  distance 
between  D17S776  and  D17S78  is  estimated  to  be  1-2  cM. 

This  chromosome  17  interval  contains  the  candidate  gene 
EDH17B2,  which  codes  for  17/3  hydroxysteroid  dehydrogenase 
(17/3  HSD  II).  17/3  HSD  II  is  responsible  for  the  reversible 
conversion  of  estrone,  a  weak  estrogen,  into  the  more  potent  17/3 
estradiol  and  has  been  considered  to  be  a  good  candidate  gene 
for  BRCAl.  However,  we  and  others  have  sequenced  DNA  from 
several  unrelated  BRCAl  carriers  and  have  failed  to  show 
mutations  in  this  gene  which  would  implicate  it  as  the 
susceptibility  gene  of  the  cancer  syndrome  (5,6).  EDH17B1  has 


a  similar  genomic  structure  to  EDH17B2,  but  data  suggest  that 
EDH17B1  is  a  pseudogene  (9).  To  date,  no  genetic  recombinants 
have  been  reported  between  the  loci  BRCAl  and  EDH17B2. 

RESULTS 

We  report  here  haplotype  analysis  on  breast— ovarian  cancer 
Family  2775  from  the  Creighton  University  Family  Cancer 
Registry.  This  analysis  appears  to  exclude  EDH17B2  from  the 
chromosomal  region  of  assignment  of  BRCAl.  The  three-point 
lod  score  for  this  family  was  1 .55  using  the  closely-linked  markers 
D17S855  and  D17S859.  This  lod  score  was  calculated  at 
D17S855,  assuming  a  2  cM  (sex-averaged)  distance  between 
D17S855  and  D17S859,  under  our  previously  described  linkage 
model  (10).  Based  on  a  consortium  analysis  of  48  families,  it 
has  been  estimated  that  79%  of  families  with  multiple  cases  of 
eariy-onset  breast  cancer  and  a  single  case  of  ovarian  cancer  are 
linked  to  BRCAl  (D.Ford,  personal  communication).  Given  the 
positive  lod  score,  the  probability  that  family  2775  is  linked  to 
BRCAl  is  therefore  estimated  to  be  above  99%.  A  common 
chromosome  17q  haplotype  from  D17S250  to  D17S588  is  shared 
by  seven  of  the  eight  women  with  breast  cancer  in  this  pedigree 
who  have  been  typed  with  these  markers  (Fig.  1).  Individual  128 
was  diagnosed  with  invasive  breast  cancer  at  age  45.  She  inherited 
the  marker  allele  for  the  D17S855  polymorphism  associated  with 
high  cancer  risk  from  her  father,  as  weU  as  several  other  telomeric 
markers  associated  with  increased  risk.  However,  she  inherited 
allele  2  of  the  HSD.E  polymorphism,  and  allele  1  of  the  HSD.del 
polymorphism  from  her  father.  Elsewhere  in  the  pedigree  HSD.E 
allele  1  and  HSD.del  allele  2  are  coupled  with  the  BRCAl 
mutation.  It  appears,  therefore,  that  either  individual  87  or 
individual  126  have  inherited  a  recombinant  chromosome  17 
(presumably  from  their  affected  mother).  Individual  128  shares 
the  chromosome  17  markers  distal  to,  but  not  including,  the 
EDH17B2  locus,  with  the  affected  descendants  of  individual  87. 
If  we  infer  from  her  eariy-onset  breast  cancer  that  she  inherited 
the  BRCAl  mutation  as  well,  then  this  recombinant  places 
BRCAl  telomeric  to  EDH17B2.  Further  analyses  exclude  the 
centromeric  markers  D17S857  and  D17S846  as  well.  The 
telomeric  markers  D17S855,  D17S579,  D17S509,  D17S293 
were  also  informative  but  not  excluded.  Two  other  women  in 
the  family  (individuals  143  and  12)  have  also  inherited  the 


*To  whom  correspondence  should  be  addressed 
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Figure  1.  Pedigree  of  family  2775.  Black  circles  indicate  affected  women.  Diagonal  slash  indicates  deceased.  Individual  identification  numbers  appear  directly  below 
the  symbols.  Br  =  breast  cancer,  Ov  =  ovarian  cancer,  Retr  =  retroperitoneal  cancer.  Pro  =  prostate  cancer,  Co  =  colon  cancer,  BCC  =  basal  cell  carcinoma. 
The  numbers  following  these  abbreviations  indicate  ages  of  diagnosis.  The  numbers  arranged  vertically  below  the  individual  symbols  indicate  the  marker  alleles 
arranged  into  haplotypes.  Haplotypes  in  brackets  are  inferred.  Marker  alleles  separated  by  a  comma  cannot  be  phased.  A  dash  in  the  place  of  a  marker  typing  indicates 
missing  information.  Vertical  rectangles  are  used  to  indicate  the  haplotype  found  to  be  associated  with  breast  or  ovarian  cancer  in  the  family. 

haplotype  of  risk  and  have  not  developed  breast  or  ovarian  cancer.  DISCUSSION 

Individual  143  underwent  a  bilateral  oophorectomy  at  age  48  (her  The  recombinant  in  Family  2775  suggests  that  BRCA 1  is  located 

ovaries  were  described  as  normal).  She  is  currently  52  years  old.  telomeric  to  EDH17B2  on  chromosome  17ql2-q21.  This 

Individual  12  is  currently  31  years  old.  recombinant  significantly  narrows  the  minimal  region  of 
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Table  1.  Polymorphisms  used  in  this  analysis 


Locus 

Marker 

Type  of 
Polymorphism 

Reference 

D17S250 

MFD15 

CA 

13 

ERBB2 

ERBB2 

RFLP  (Mbol) 

14 

THRAl 

THRAl 

CA 

4 

D17S800 

200ZF4 

CA 

15 

D17S857 

OFl 

CA 

16 

D17S846 

UM8 

CA 

17 

EDH17B1 

HSD.del 

12  bp  deletion 

18 

EDH17B2 

HSD.E 

RFLP  (Bfal) 

19 

D17S855 

248Yg9 

CA 

15 

D17S859 

OF4 

CA 

16 

D17S78 

pl31 

RFLP  {Mspl) 

20 

D17S579 

Mfdl88 

CA 

14 

D17S509 

clbl7.1 

RFLP  (Haeni) 

10 

D17S293 

6C1 

CA 

14 

D17S588 

42D6 

CA 

3 

D17S74 

CMM86 

VNTR  (Hinfi) 

1 

assignment  of  BRCAl  and  should  expedite  efforts  to  clone  this 
gene.  However,  because  of  the  possibility  of  sporadic  cases  of 
breast  and  ovarian  cancers,  no  single  recombinant  can  be  taken 
as  proof  of  BRCAl  placement.  Errors  in  the  assignment  of  the 
location  of  BRCAl  have  been  made  in  the  past  based  on  single 
cases  of  breast  cancer  (e.g.  ref.  10).  In  contrast  to  our  results, 
Smith  and  colleagues  map  BRCAl  to  a  position  centromeric  to 
EDH17B2  (8).  However,  there  was  very  little  evidence  that  the 
family  of  Smith  et  al.  was  linked  to  BRCAl,  and  more  recent 
data  reveal  that  the  case  of  ovarian  cancer  in  this  family  (who 
was  not  typed  at  the  time  of  the  published  report)  did  not  share 
a  common  chromosome  17q  haplotype  with  the  three  cases  of 
early-onset  breast  cancer.  Eiier  she  is  a  sporadic  case  of  ovarian 
cancer  or  the  family  is  unlinked  to  BRCAL  A  recent  report  of 
loss  of  heterozygosity  (LOH),  comparing  DNA  from  130 
sporadic  breast  cancers  with  constitutional  DNA,  also  suggests 
a  placement  of  BRCAl  which  is  inconsistent  with  our  results  (1 1). 
These  investigators  identified  a  minimum  common  region  of  loss 
which  was  proximal  to  D17S776,  which  in  turn  is  reported  to 
be  centromeric  to  EDHB2.  The  relevance  of  such  LOH  studies 
to  mapping  BRCAl  is  currently  unclear. 

The  evidence  that  BRCAl  is  telomeric  to  EDH17B2  in  Family 
2775  is  the  strongest  so  far.  Linkage  to  BRCAl  is  not  in  doubt; 
we  estimate  that  the  probability  that  the  family  is  linked  to  BRCAl 
is  above  99% .  A  more  important  consideration  is  that  individual 
128  may  be  a  sporadic  case.  Our  linkage  model  (described  in 
reference  10)  assigns  a  relative  risk  of  60  for  the  incidence  rate 
of  breast  cancer  in  BRCAl  carriers  versus  nomcarriers  at  the 
age  of  45.  Based  on  pedigree  data  alone  (i.e.,  not  the  DNA 
markers)  this  model  pr^icts  that  individual  128  has  a  94%  chance 
of  having  inherited  a  BRCAl  mutation,  versus  a  6%  chance  of 
being  a  sporadic  case.  Other  prior  probabilities  of  carrying 
BRCAl  mutations  in  this  family  are  estimated  to  be  44%  for 
individual  143,  93%  for  individual  126,  and  49%  for  individual 
12.  Interestingly,  prostate  cancer,  which  was  diagnosed  in 
individual  126,  appears  to  be  increased  in  frequency  among  males 
with  BRCAl  mutations  (12).  Because  of  the  uncertainty  of 
exclusion  attributable  to  a  single  recombinant,  it  is  important  that 
the  present  observations  be  replicated. 

We  are  able  to  support  an  assignment  of  BRCAl  telomeric 
to  the  more  proximal  marker  D17S857  in  a  second  breast— ovary 
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Figure  2.  Pedigree  of  family  101.  Symbols  as  in  figure  1. 


cancer  family,  Family  101  (Fig.  2).  This  family,  which  contains 
two  cases  of  early  onset  breast  cancer  and  two  cases  of  ovarian 
cancer  has  a  96%  probability  of  being  linked  to  BRCAL 
Including  the  present  observations  there  have  now  been  six 
families  reported  in  which  recombinants  which  place  BRCAl 
distal  to  D17S857  (6-8). 


1682  Human  Molecular  Genetics,  1994,  VoL  3,  No.  9 


t 


MATERIAL  AND  METHODS 

Family  2775  was  identified  in  the  mid-westem  United  States  and  is  followed  at 
the  Creighton  University  School  of  Medicine.  The  family  was  initially  identified 
for  linkage  studies  because  of  ten  confirmed  cases  of  breast  cancer  which  appeared 
at  an  average  age  of  43.7  years  (range  31  to  66  years).  Recently,  a  woman  who 
was  previously  diagnosed  with  breast  cancer  at  the  age  of  41  was  found  to  have 
ovarian  cancer.  The  family  therefore  qualifies  as  a  breast —ovarian  cancer  family. 
Other  cancers  in  this  family  include  four  cases  of  colon  cancer,  two  cases  of 
prostate  cancer  and  a  basal  cell  carcinoma.  Family  101  was  identified  at  McGill 
University  in  1990. 

Family  members  were  typed  with  the  polymorphic  markers  listed  in  Table  1 . 
The  polymorphism  HSD.del  is  located  in  the  5'  flanking  sequence  of  EDH17B1 
and  is  defined  by  two  alleles  which  differ  in  size  by  12  base  pairs  (18).  The  HSD.E 
polymorphism  is  a  PCR-based  RFLP  located  in  the  intron  between  exons  I  and 
II  of  EDH17B2  (19).  Polymorphic  markers  for  the  loci  IGFBP4,  keratin  10, 
D17S776,  SCG43,  D17S507  and  GIP  were  also  tested  in  family  2775,  but  were 
not  informative  for  the  critical  recombinant.  LOD  scores  were  calculated  with 
the  MLINK  program  (21).  The  calculation  of  the  posterior  probability  of  linkage 
is  based  on  a  method  described  previously  (22). 
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BRCAI  Mutations  in  Ashkenazi  Jewish  Women 

To  the  Editor. 

The  gene  for  breast  and  ovarian  cancer,  BRCAI,  has 
recently  been  cloned  (Mild  et  al.  1994).  To  date,  we 
have  identified  BRCAI  mutations  in  24  North  American 
families.  In  a  recent  collaborative  report,  the  most  com¬ 
mon  BRCAI  mutation  found  was  a  2-bp  deletion  in 
exon  2  (Shattuck-Eidens  et  al.  1995).  This  mutation 
(185delAG)  was  present  in  6  of  our  24  breast-ovarian 
cancer  families.  All  six  families  are  of  Ashkenazi  Jewish 
origin.  The  families  are  not  known  to  be  related  to  each 
other,  but  haplotype  analyses  suggest  that  these  six  fami¬ 
lies  have  a  common  ancestor  (Simard  et  al.  1994),  Only 
1  of  the  other  18  families  with  BRCAI  mutations  was 
known  to  be  of  Jewish  origin  or  had  founders  with  sur¬ 


names  suggestive  of  Ashkenazi  heritage  (P  <  .0001).  In 
total,  six  of  the  seven  Ashkenazi  families  were  found  to 
carry  the  185delAG  mutation.  Three  other  Ashkenazi 
families  with  this  mutation  have  been  reported  by 
Struewing  et  al.  (1995;  in  this  issue).  If  the  majority 
of  hereditary  breast-ovarv’’  cancer  families  in  any  ethnic 
subgroup  can  be  attributed  to  a  small  number  of  muta¬ 
tions,  our  effoiTs  to  provide  DNA-based  predictive  test¬ 
ing  will  be  greatly  enhanced. 
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